WORLD INTELLECTUAL PROPERTY ORGANIZATION 

International Bureau 



PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 

A01H 5/00, 5/10, A23D 7/00, A23K 1/14, 
C07H 21/04, C07K 4/10, C12N 15/04, 
15/63, 15/82 



Al 



(11) International Publication Number: 



(43) International Publication Date: 



WO 99/02030 



21 January 1999 (2 K01. 99) 



(21) International Application Number: 

(22) International Filing Date: 



PCT/US98/ 14528 



10 July 1998 (10.07.98) 



(30) Priority Data: 

60/052.249 



11 July 1997 (11.07.97) 



US 



(71 ) Applicant (for all designated States except US): WASHING- 

TON STATE UNIVERSITY RESEARCH FOUNDATION 
[US/ US]; N.E. 1615 Eastgate Boulevard, Pullman. WA 
99164-1802 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): BOHLMANN, Joerg 
[DE/DE]; Sophienstrasse 21, D^)745 Jena (DE). STEELE, 
Christopher, L. [US/US]; 617 Campbell Street, Ardmore, 
OK 973401 (US). CROTEAU, Rodney, B. [US/US]; N.E. 
1835 Valley Road, Pullman, WA 99163 (US). 

(74) Agent: SHELTON, Dennis, K.; Christensen O'Connor Johnson 
& Kindness PLLC, Suite 2800, 1420 Fifth Avenue, Seattle, 
WA 98101 (US). 



(81) Designated States: AL, AM. AT, AU, AZ, BA, BB, BG, BR, 
BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, Fl, GB, GE, 
GH, GM, HU, ID, IL, IS, JP, KE, KG, KP, KR, KZ, LC, 
LK, LR, LS, LT, LU, LV, MD, MG, MK, MN, MW, MX, 
NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, TJ, TM, 
TR, TT, UA, UG, US, UZ, VN, YU, ZW, ARIPO patent 
( GH, GM, KE, LS, MW. SD, SZ, UG, ZW), Eurasian patent 
(AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European patent 
(AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, 
LU, MC, NL. PT, SE), OAPI patent (BF, BJ, CF, CG, CI, 
CM, GA, GN, GW, ML, MR, NE, SN, TD, TG). 



Published 

With international search report. 

Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) Title: MONOTERPENE SYNTHASES FROM GRAND FIR (ABIES GRANDIS) 
(57) Abstract 

cDNAs encoding myrcene synthase, (-)-limonene synthase and (-)-pinene synthase from Grand fir (Abies grandis) haven been isolated 
and sequenced, and the corresponding amino acid sequences have been determined. Accordingly, isolated DNA sequences (SEQ) ID NO:l; 
SEQ ID NO: 3 and SEQ ID NO: 5) are provided which code for the expression of myrcene synthase (SEQ ID NO: 2), (-)-pinene synthase 
(SEQ ID NO: 4) and (-)-limonene synthase (SEQ ID NO: 6) , respectively, from Grand fir (Abies grandis). In other aspects, replicable 
recombinant cloning vehicles are provided which code for myrcene synthase, (-)-limonene synthase and (-)-pinene synthase, or for a base 
sequence sufficiently complementary to at least a portion of myrcene synthase, (-)-limonene synthase or (-)— pinene synthase DNA or RNA 
to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected 
and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding myrcene synthase, (-)-limonene synthase or (-)-pinene 
synthase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant monoterpene 
synthases that may be used to facilitate their production, isolation and purification in significant amounts. Recombinant myrcene synthase, 
(-)-limonene synthase or (-)-pinene synthase may be used to obtain expression or enhanced expression of myrcene synthase, (-Mimonene 
synthase or (-)-pinene synthase in plants in order to enhance the production of monoterpenoids, or may be otherwise employed for the 
regulation or expression of myrcene synthase, (-)-limonene synthase and (-)-pinene synthase, or the production of their products. 
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MONOTERPENE SYNTHASES FROM GRAND FIR (Abies gnmdis) 

Field of the Invention 

The present invention relates to nucleic acid sequences which code for 
monoterpene synthases from gymnosperm plant species, in particular from Grand fir 
(Abies grandis), including limonene synthase, myrcene synthase, and pinene 
synthase, to vectors containing the sequences, to host cells containing the sequences, 
to plant seeds expressing the sequences and to methods of producing recombinant 
monoterpene synthases and their mutants. 

Background of the Invention 

Chemical defense of conifer trees against bark beetles and their associated 
fungal pathogens relies primarily upon constitutive and inducible oleoresin 
biosynthesis (Johnson, M.A., and Croteau, R. (1987) in Ecology and Metabolism of 
Plan, Lipids (Fuller. G., and Nes, W.D., eds.) pp. 76-91 , American Chemical Society 
Symposium Series 325, Washington, DC; Gijzen, M.. Lewinsohn, E., Savage, T.j", 
and Croteau. R.B. (1993) in Bioaclive Volatile Compounds from Plants (Teranishi! 
R.. Buttery. R.G., and Sugisawa, FI., eds.) pp. 8-22, American Chemical Society 
Symposium Series 525, Washington, DC). This defensive secretion is a complex 
mixture of monoterpene and sesquiterpene olefins (turpentine) and diterpene resin 
acids (rosin) that is synthesized constitutively in the epithelial cells of specialized 
structures, such as resin ducts and blisters or, in the case of induced oleoresin 
formation, in undifferentiated cells surrounding wound sites (Lewinsohn. E., Gijzen, 
M.. Savage, T..1., and Croteau, R. (1991) Plant Physiol. 96:38-43). The volatile 
fraction of conifer oleoresin, which is toxic to both bark beetles and their fungal 
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associates (Raffa. K.F.. Berryman, A.A.. Simasko. J.. Teal. \V.. and \Von« B L 
(1985) Environ Entomol. 14:552-556). may consist of up to 30 different 
monoterpenes (Lewinsohn, E., Savage, T.J., Gijzen. M. and Croteau. R. (1993) 
Phyiodwm. Anal. 4:220-225). including acyclic types (e.g.. myrcenc), monocyclic 
types (e.g.. limonene) and bicyclic types (e.g., pinenes) (FIGURE 1). Although the 
oleoresin is toxic, many bark beetle species nevertheless employ turpentine volatiles 
in host selection and can convert various monoterpene components into aggregation 
or sex pheromones to promote coordinated mass attack of the host (Gijzen. M.. 
Lewinsohn. E.. Savage, T.J., and Croteau. R.B. (1993) in Dioaaivc Volatile 
Compounds from Plants (Teranishi. R., Buttery, R.G., and Sugisawa. H., eds.) pp. 8- 
22. American Chemical Society Symposium Series 525, Washington. DC; Byers, 
-A. (.1995) in Chemical Ecology of Insects 2 (Carde, R.T., and Bell. W.J.. eds.) 
PP. 154-213. Chapman and Hall, New York). In Grand fir (Abies grandis), increased 
formation of oleoresin monoterpenes, sesquiterpenes and diterpenes is induced by 
bark beetle attack (Lewinsohn. E., Gijzen, M„ Savage, T.J., and Croteau, R. (1991) 
Plant Physiol. 96:38-43; Raffa, K.F., and Berryman, A. A. (1982) Can 
Entomol. 114:797-810; Lewinsohn, E., Gijzen, M., and Croteau, R. (1991) Plant 
Physiol. 96:44-49), and this inducible defense response is mimicked by mechanically 
wounding sapling stems (Lewinsohn, E., Gijzen, M., Savage, T.J., and Croteau. R. 
(1991) Plant Physiol. 96:38-43; Lewinsohn. E., Gijzen, M., and Croteau, R. (1991) 
Plant Physiol. 96:44-49; Funk, C, Lewinsohn, E., Stofer Vogel, B., Steele C, and 
Croteau, R. (1994) Plan, Physiol. 106:999-1005). Therefore, Grand f.r has been 
developed as a model system to study the biochemical and molecular genetic 
regulation of constitutive and inducible terpene biosynthesis in conifers (Steele, C. 
Lewinsohn. E., and Croteau, R. (1995) Proc. Natl. Acad. Sci. USA 92:4164-4168). 

Most monoterpenes are derived from geranyl diphosphate, the ubiquitous C 10 
intermediate of the isoprenoid pathway, by synthases which catalyze the divalent 
metal ion-dependent ionization (to 1. FIGURE 1) and isomerization of this substrate 
to enzyme-bound linalyl diphosphate which, following rotation about C2-C3, 
undergoes a second ionization (to 2, FIGURE 1) followed by cyclization to the 
cx-lerpinyl cation, the first cyclic intermediate en route to both monocyclic and 
bicyclic products (Croteau, R., and Cane, D.E. (1985) Methods Enzymol. 110:383- 
405; Croteau, R. (1987) Chan. Rev. 87:929-954) (FIGURE 1 ). Acyclic 
monoterpenes. such as myrcene, may arise by deprotonation of carbocations 1 or 2, 
whereas the isomerization step to linalyl diphosphate is required in the case of cyclic 
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types, such as limonene and pinen.es, which cannot be derived from geranyl 
diphosphate directly because of the geometric impediment of the trans-double bond 
at C2-C3 (Croleau, R., and Cane, D.E. (1985) Methods Enzymol. 110:383-405; 
Croteau, R. (1987) Chem. Rev. 87:929-954). Many monoterpene synthases catalyze 
the formation of multiple products, including acyclic, monocyclic and bicyclic types, 
by variations on this basic mechanism (Gambliel, H.. and Croteau, R. (1984) J. Biol. 
Chem. 259:740-748; Croteau, R., Satterwhite, D.M., Cane, D.E.. and Chang, C.C. 
(1988) J. Bivl. Chem. 263:10063-10071; Croteau, R., and Satterwhite, D.IvlM 1989) 
J. Biol. Chem. 264:15309-15315). For example, (-)-limonene synthase, the principal 
monoterpene synthase of spearmint (Mentha spicata) and peppermint (M x piperita), 
produces small amounts of myrcene, (-)-a-pinene and (-)-p-pinene in addition to the 
monocyclic product (Rajaonarivony, J.I.M., Gershenzon, J., and Croteau, R. (1992) 
Arch. Biochem. Biophys. 296:49-57; Colby, S.M., Alonso, W.R., Katahira, E.J. 
McGarvey. DJ.. and Croteau, R. (1993) ./. Biol Chem. 268:23016-23024. 
Conversely, six different inducible monoterpene synthase activities have been 
demonstrated in extracts of wounded Grand fir stem (Gijzen, M., Lewinsohn, E., and 
Croteau, R. ( 199 1 ) Arch. Biochem. Biophys. 289:267-273) indicating that formation 
of acyclic, monocyclic and bicyclic monoterpenes in this species involves several 
genes encoding distinct catalysts. The inducible (-)-pinene synthase has been 
purified (Lewinsohn, E., Gijzen, M., and Croteau, R. (1992) Arch. Biochem. 
Biophys. 293:167-173), and isotopically sensitive branching experiments employed 
to demonstrate that this enzyme synthesizes both (-)-a- and (-)-(3-pinene (Wagschal, 
K., Savage, T.J., and Croteau, R. (1991) Tetrahedron 47:5933-5944). 

Deciphering the molecular genetic control of oleoresinosis and examining 
structure-function relationships among the monoterpene synthases of Grand fir 
requires isolation of the cDNA species encoding these key enzymes. Although a 
protein-based cloning strategy was recently employed to acquire a cDNA for the 
major wound-inducible dilerpene synthase from Grand fir, abietadiene synthase 
(Funk. C. Lewinsohn, E., Stofer Vogel. B., Steele C, and Croteau, R. (1994) Plant 
Physiol. 106:999-1005; Lal'ever, R.E., Stofer Vogel. 13., and Croleau. R. (1994) 
Arch. Biochem. Biophys. 313:139-149; Stofer Vogel, B.. Wildung, M.R.. Vogel, G., 
and Croteau. R. (1996) J. Biol. Chem. 271:23262-23268). all attempts at the reverse 
genetic approach to cloning of Grand fir monoterpene synthases have failed (Steele. 
C. Lewinsohn, E., and Croteau, R. (1995) Proc. Natl. Acad. Sci. 92:4164- 
4168). As an alternative, a similarity-based PCR strategy was developed (Steele, C, 
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Lew in so hn, E., and Croteau. R. (1995) Proc. Nail. Acad. Sci. t7£4 92:4164-4168) 
that employed sequence information from terpene synthases of angiosperm origin, 
namely a monoterpene synthase, (-)-4S-limonene synthase, from spearmint (Mentha 
spicata, Lamiaceae) (Colby, S.M., Alonso, W.R., Katahira, E.J., McGarvey, D.J., and 
5 Croieau, R. (1993) J. Biol. Chem. 268:23016-23024), a sesquiterpene 
synthase, 5-ep/-aristolochene synthase, from tobacco (Nicotiana labacum. 
Solanaeeae) (Facchini, PJ.. and Chappell, J. (1992) Proc. Nail. Acad. Sci. 
USA 89:1 1088-1 1092), and a diterpene synthase, casbene synthase, from castor bean 
{Ricinus communis, Euphorbiaceae) (Mau, C.J.D., and West, C.A, (1994) Proc Natl. 
10 Acad Sci. USA 91:8497-8501). 

Monoterpenes have significant potential for cancer prevention and treatment. 
Monoterpenes such as limonene, perillyl alcohol, carvone, geraniol and farnesol not 
only reduce tumor incidence and slow tumor proliferation, but have also been 
reported to cause regression of established solid tumors by initiating apoptosis (Mills 
15 J J., Chari R.S., Boyer I.J., Gould M.N., Jirtle R.L., Cancer Res.* 55:979-983, 1995). 
Terpenes have activity against cancers such as mammary, colon, and prostate. 
Clinical trials are being pursued (Seachrist L, J. N1H Res. 8:43) in patients with 
various types of advanced cancers to validate the health benefits of dietary terpenes 
for humans. However, terpenes are present in Western diets at levels that are 
20 probably inadequate for any significant preventive health benefits. Daily 
supplementation of the diet with a terpene concentrate (10-20 g/day) would appear to 
be the most rational strategy for dietary therapy of diagnosed cases of cancer. This 
invention envisages the production of such nutritionally beneficial terpenes in 
vegetable oils consumed daily via the engineering of relevant genes from Grand fir 
25 into oil seed crop plants such as oil seed brassica (canola), soybean and corn. 

Brief Description of the Drawings 
The foregoing aspects and many of the attendant advantages of this invention 
will become more readily appreciated as the same becomes better understood by 
reference to the following detailed description, when taken in conjunction with the 
30 accompanying drawings, wherein: 

FIGURE 1 is a schematic representation depicting the mechanism for the 
conversion of geranyl diphosphate to myrcene, (-)-limonene, p-phellandrene, 
(-)-a-pinene and (-)-(3-pinene by monoterpene synthases from Grand fir. Formation 
of the monocyclic and bicyclic products requires preliminary isomerization of 
35 geranyl diphosphate to linalyl diphosphate. The acyclic product could be formed 
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from either geranyl diphosphate or Iinalyl diphosphate via carboeations 1 or 2. OPP 
denotes the diphosphate moiety. 

FIGURE 2 is a sequence comparison of plant terpene synthases. A three- 
letter designation (Tps) for the gene family is proposed with sub-groups (Tpsa 
through Tpsf) defined by a minimum of 40 % amino acid identity between members. 
The numbers in parenthesis are the references to the published sequences. 

FIGURE 3 depicts a GLC-MS analysis of the products of the recombinant 
protein encoded by AG2.2 (SEQ ID NO:l), the sequence of the protein encoded by 
clone AG2.2 being set forth in SEQ ID NO:2. The GLC profile of the total pentane- 
soluble products generated from geranyl diphosphate when incubated with a cell-free 
extract of E. coli XL1 -Blue/ P GAG2.2 is illustrated (a), as are the mass fragmentation 
patterns for the monoterpene product with R, = 12.22 min (b) and for authentic 
myrcene (c). 

FIGURE 4 depicts a GLC-MS analysis of the products of the recombinant 
protein encoded by AGS. 18 (SEQ ID NO:3), the sequence of the protein encoded by 
clone AG3. 18 (SEQ ID NO:3) being set forth in SEQ ID NO:4. The GLC profile of 
the total pentane-soluble products generated from geranyl diphosphate when 
incubated with a cell-free extract of E. coli XLl-Blue/pGAG3.18 is illustrated (a), as 
are the mass fragmentation patterns (selected ion mode) for the monoterpene 
products with R, = 1 1 .34 min (b), and R t = 13.37 min (d). and for authentic a-pinene 
(c) and authentic p-pinene (e). 

FIGURE 5 depicts a GLC-MS analysis of the products of the recombinant 
protein encoded by AGIO (SEQ ID NO:5), the sequence of the protein encoded by 
clone AG JO (SEQ ID NO:5) being set forth in SEQ ID NO:6. The GLC profile of 
the total pentane-soluble products generated from geranyl diphosphate when 
incubated with a cell-free extract of E. coli BL21(DE3)/pSBAG10 is illustrated (a), 
as are the mass fragmentation patterns for the principal monoterpene product with R, 
= 13.93 min (b) and for authentic limonene (c). 

Summary of the Invention 

In accordance with the foregoing, cDNAs encoding myrcene synthase, 
(-)-li.noncnc synthase and (-)-pincne synthase from Grand fir {Abies grandis) have 
been isolated and sequenced, and the corresponding amino acid sequences have been 
deduced. Accordingly, the present invention relates to isolated DNA sequences 
which code for the expression of myrcene synthase, such as the sequence designated 
SEQ ID NO: 1 which encodes myrcene synthase from Grand fir (Abies grandis), for 
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the expression of (-)-pinene synthase, such as the sequence designated SEQ ID NO:3, 
which encodes the (-)-pinene synthase from Grand fir (Abies grandis) and for the 
expression of (-)-limonene synthase, such as the sequence designated SEQ ID NO:5, 
which encodes (-)-limonene synthase from Grand fir {Abies grandis). In other 
aspects, the present invention is directed to replicable recombinant cloning vehicles 
comprising a nucleic acid sequence, e.g., a DNA sequence which codes for a 
myrcene synthase, (-)-limonene synthase or (-)-pinene synthase, or for a base 
sequence sufficiently complementary to at least a portion of DNA or RNA encoding 
myrcene synthase, (-)-limoncnc synthase or (-)-pincnc synthase to enable 
hybridization therewith (e.g., antisense RNA or fragments of DNA complementary to 
a portion of DNA or RNA molecules encoding myrcene synthase, (-)-limonene 
synthase or (-)-pinene synthase which are useful as polymerase chain reaction 
primers or as probes for any of the foregoing synthases or related genes). In yet other 
aspects of the invention, modified host cells are provided that have been transformed, 
1> transfected. infected and/or injected with a recombinant cloning vehicle and/or DNA 
sequence of the invention. Thus, the present invention provides for the recombinant 
expression of myrcene synthase, (-)-limonene synthase and (-)-pinene synthase, and 
the inventive concepts may be used to facilitate the production, isolation and 
purification of significant quantities of recombinant myrcene synthase, (-)-Iimonene 
synthase and (-)-pinene synthase (or of their primary enzyme products) for 
subsequent use, to obtain expression or enhanced expression of myrcene synthase, 
(-)-limonene synthase and (-)-pinene synthase in plants, microorganisms or animals, 
or may be otherwise employed in an environment where the regulation or expression 
of myrcene synthase, (-)-limoncnc synthase and (-)-pinene synthase is desired for the 
production of these synthases, or their enzyme products, or derivatives thereof. In 
another aspect, the present invention relates to manipulation of monoterpene 
production to enhance resistance to insects and/or accumulate nutritionally beneficial 
monoterpenes in oil seeds. 

Detaile d Description of the Preferred Embodiment 
30 As used herein, the terms "amino acid n and "amino acids 1 ' refer to all 

naturally occurring L-cx-amino acids or their residues. The amino acids are identified 
by either the single-letter or three-letter designations: 

Asp D aspartic acid He I isoleucine 

Thr T threonine Leu L leucine 

Ser S serine Tyr Y tyrosine 
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Glu 


E 


glutamic acid 


Phe 


F 


phenylalanine 


Pro 


P 


proline 


His 


H 


histidine 


Gly 


G 


glycine 


Lys 


K 


lysine 


Ala 


A 


alanine 


Arg 


R 


arginine 


Cys 


C 


cysteine 


Trp 


W 


tryptophan 


Val 


V 


valine 


Gin 


Q 


glutamine 


Mel 


M 


methionine 


Asn 


N 


asparagine 



As used herein, the term "nucleotide" means a monomeric unit of DNA or 
RNA containing a sugar moiety (pentose), a phosphate and a nitrogenous 
heterocyclic base. The base is linked to the sugar moiety via the glycosidic carbon 
(1' carbon of pentose) and that combination of base and sugar is called a nucleoside. 
The base characterizes the nucleotide with the four bases of DNA being adenine 
("A"), guanine ("G"), cytosine ("C") and thymine ("T"). Inosine ("I") is a synthetic 
base that can be used to substitute for any of the four, naturally-occurring bases (A, 
> C, G or T). The four RNA bases are A,G,C and uracil ("U"). The nucleotide 
sequences described herein comprise a linear array of nucleotides connected by 
phosphodiester bonds between the 3' and 5" carbons of adjacent pentoses. 

The term "percent identity" means the percentage of amino acids or 
nucleotides that occupy the same relative position when two amino acid sequences, 
0 or two nucleic acid sequences are aligned side by side. 

The term "percent similarity" is a statistical measure of the degree of 
relatedness of two compared protein sequences. The percent similarity is calculated 
by a computer program that assigns a numerical value to each compared pair of 
amino acids based on chemical similarity (e.g., whether the compared amino acids 
are acidic, basic, hydrophobic, aromatic, etc.) and/or evolutionary distance as 
measured by the minimum number of base pair changes that would be required to 
convert a codon encoding one member of a pair of compared amino acids to a codon 
encoding the other member of the pair. Calculations are made after a best fit 
alignment of the two sequences have been made empirically by iterative comparison 
of all possible alignments, (llenikoff, S. and HenikoiV, J.G., Proc. Nat 'I. Acad. Sci. 
USA 89:10915-10919, 1992). 

"Oligonucleotide" refers to short length single or double stranded sequences 
of deoxyribonucleotides linked via phosphodiester bonds. The oligonucleotides are 
chemically synthesized by known methods and purified, for example, on 
polyacrylamide gels. 
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1 he term "myrcene synthase" is used herein to mean an enzyme capable of 
generating multiple monoterpenes from geranyl diphosphate. The principal and 
characteristic monoterpene synthesized by myrcene synthase is myrcene. which 
constitutes at least about 60% of the monoterpene mixture synthesized by myrcene 
5 synthase from geranyl diphosphate. 

1 he term "(-)-limonene synthase" is used herein to mean an enzyme capable 
ol generating multiple monoterpenes from geranyl diphosphate. The principal and 
characteristic monoterpene synthesized by (-)-limonene synthase is (-)-limonene. 
which constitutes at least about 60% of the monoterpene mixture synthesized by 
1 0 (-)-limonene synthase from geranyl diphosphate. 

The term "(-)-pinene synthase" is used herein to mean an enzyme capable of 
generating multiple monoterpenes from geranyl diphosphate. The principal and 
characteristic monoterpene synthesized by (-)-pinene synthase is (-)-pinene, which 
comprises at least about 60% of the monoterpene mixture synthesized by (-)-pinene 
1 5 synthase from geranyl diphosphate. 

The abbreviation "SSPE" refers to a buffer used in nucleic acid hybridization 
solutions. The 20X (twenty times concentrate) stock SSPE buffer solution is 
prepared as follows: dissolve 175.3 grams of NaCl, 27.6 grams of NaH 2 P0 4 H 2 0 and 
7.4 grams of EDTA in 800 millilitres of H 2 0. Adjust the pH to pH 7.4 with NaOH. 
20 Adjust the volume to one liter with H 2 0. 

The terms "alteration", "amino acid sequence alteration", "variant" and 
"amino acid sequence variant" refer to monoterpene synthase molecules with some 
dillercnces in their amino acid sequences as compared to the corresponding, native, 
i.e., naturally-occurring, monoterpene synthases. Ordinarily, the variants will 
possess at least about 70% homology with the corresponding native monoterpene 
synthases, and preferably, they will be at least about 80% homologous with the 
corresponding, native monoterpene synthases. The amino acid sequence variants of 
the monoterpene synthases falling within this invention possess substitutions, 
deletions, and/or insertions at certain positions. Sequence variants of monoterpene 
synthases may be used to attain desired enhanced or reduced enzymatic activity, 
modified regiochemistry or stereochemistry, or altered substrate utilization or 
product distribution. 

Substitutional monoterpene synthase variants are those that have at least one 
amino acid residue in the native monoterpene synthase sequence removed and a 
different amino acid inserted in its place at the same position. The substitutions may 
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be single, where only one amino acid in the molecule has been substituted, or they 
may be multiple, where two or more amino acids have been substituted in the same 
molecule. Substantial changes in the activity of the monoterpene synthase molecules 
of the present invention may be obtained by substituting an amino acid with a side 
chain that is significantly different in charge and/or structure from that of the native 
ammo acid. This type of substitution would be expected to affect the structure of the 
polypeptide backbone and/or the charge or hydrophobicity of the molecule in the area 
of the substitution. 

Moderate changes in the activity of the monoterpene synthase molecules of 
the present invention would be expected by substituting an amino acid with a side 
chain thai is similar in charge and/or structure to that of the native molecule. This 
type of substitution, referred to as a conservative substitution, would not be expected 
to substantially alter either the structure of the polypeptide backbone or the charge or 
hydrophobicity of the molecule in the area of the substitution. 

Insertional monoterpene synthase variants are those with one or more amino 
acids inserted immediately adjacent to an amino acid at a particular position in the 
native monoterpene synthase molecule. Immediately adjacent to an amino acid 
means connected to either the a-carboxy or cc-amino functional group of the amino 
acid. The insertion may be one or more amino acids. Ordinarily, the insertion will 
consist of one or two conservative amino acids. Amino acids similar in charge 
and/or structure to the amino acids adjacent to the site of insertion are defined as 
conservative. Alternatively, this invention includes insertion of an amino acid with a 
charge and/or structure that is substantially different from the amino acids adjacent to 
the site of insertion. 

5 Deletional variants are those where one or more amino acids in the native 

monoterpene synthase molecules have been removed. Ordinarily, deletional variants 
will have one or two amino acids deleted in a particular region of the monoterpene 
synthase molecule. 

The terms "biological activity", "biologically active", "activity" and "active" 
refer to the ability of the monoterpene synthases of the present invention to convert 
geranyl diphosphate to a group of monoterpenes, of which myrcene is the principal 
and characteristic monoterpene synthesized by myrcene synthase, (-)-limonene is the 
principal and characteristic monoterpene synthesized by (-)-limonene synthase and 
(-)-pinene is the principal and characteristic monoterpene synthesized by (-)-pinene 
synthase. The monoterpenes produced by the monoterpene synthases of the present 
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invention are as measured in an enzyme activity assay, such as the assay described in 
Example 3. Amino acid sequence variants of the lerpene synthases of the present 
invention may have desirable altered biological activity including, for example, 
altered reaction kinetics, substrate utilization product distribution or other 
5 characteristics such as regiochemistry and stereochemistry. 

The terms "DNA sequence encoding", "DNA encoding" and "nucleic acid 
encoding" refer to the order or sequence of deoxyribonucleotides along a strand of 
deoxyribonucleic acid. The order of these deoxyribonucleotides determines the order 
or ammo acids along the translated polypeptide chain. The DNA sequence thus 
|o codes for the amino acid sequence. 

The terms "replicable expression vector" and "expression vector" refer to a 
piece ul" DNA, usually double-stranded, which may have inserted into it a piece of 
loK-.iin DNA. Foreign DNA is defined as heterologous DNA. which is DNA not 
luiuullx found in the host. The vector is used to transport the foreign or 
15 hctoologous DNA into a suitable host cell. Once in the host cell, the vector can 
replicate independently of or coincidental with the host chromosomal DNA, and 
several copies of the vector and its inserted (foreign) DNA may be generated. In 
addition, the vector contains the necessary elements that permit translating the 
foreign DNA into a polypeptide. Many molecules of the polypeptide encoded by the 
20 foreign DNA can thus be rapidly synthesized. 

The terms "transformed host cell," "transformed" and "transformation" refer 
to the introduction of DNA into a cell. The cell is termed a "host cell", and it may be 
a prokaryolic or a eukaryotic cell. Typical prokaryotic host cells include various 
strains of E. coli. Typical eukaryotic host cells are plant cells, such as maize cells, 
25 yeast cells, insect cells or animal cells. The introduced DNA is usually in the form of 
a vector containing an inserted piece of DNA. The introduced DNA sequence may 
be from the same species as the host cell or from a different species from the host 
cell, or it may be a hybrid DNA sequence, containing some foreign DNA and some 
DNA derived from the host species. 
30 The following abbreviations are used herein: bp(s), base pair(s); DEAE, 

O-diethylaminoethyl; DTT. dilhiothreitol; EDTA. ethylenediaminetetraacetic acid; 
GC, gas chromatography; 1PTG, isopropyl-P-D-thiogalactopyranoside; LB, Luria- 
Bertani; Mopso, 3-(N-morpholino)-2-hydroxypropane-sulfonic acid; MS. mass 
spectrum/ spectrometry; nt(s), nucleotide(s); ORF, open reading frame; l'CR, 
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polymerase chain reaction; PVDF, polyvinylidenedifluoride; SDS, sodium dodecyl 
sulfate; Tris, tris(hydroxymethyl) aminomethane; UV, ultraviolet. 

In accordance with the present invention, cDNAs encoding myrcene synthase 
(SEQ ID NO:l), (-)-pinene synthase (SEQ ID NO:3) and (-)-limonene synthase 
(SEQ ID NO:5) from Grand fir (Abies grandis) were isolated and sequenced in the 
following manner. Based on comparison of sequences of limonene synthase from 
spearmint (Colby, S.M., Alonso, W.R., Katahira. E.J., McGarvey, D.J., and Croteau, 
K- (1W3) J.Biol. Chem. 268:23016-23024). 5-c 7 ;,-aristolochene synthase from 
tobacco (Facchini, P.J., and Chappell, J. (1992) Proc. Nail. Acad. Sci. 
USA 89:1 1088-1 1092), and casbene synthase from castor bean (Mau, C.J.D., and 
West. C.A. (1994) Proc. Nail. Acad. Sci. USA 91:8497-8501). four conserved regions 
were identified for which a set of consensus, degenerate primers were synthesized: 
Primer A (SEQ ID NO:7), Primer 13 (SEQ ID NO:8), Primer C (SEQ ID NO:9) and 
Primer D (SEQ ID NO:10). Primers A (SEQ ID NO:7), B (SEQ ID NO:8), and D 
(SEQ ID NO: 10) were sense primers, while Primer C (SEQ ID NO:9). was an 
antisense primer. Each of the sense primers, A (SEQ ID NO:7), B (SEQ ID NO:8) 
and D (SEQ ID NO: 10), was used for PCR in combination with antisense primer C 
(SEQ ID NO:9) by employing a broad range of amplification conditions. Analysis of 
the PCR reaction products by agarose gel electrophoresis revealed that only the 
combination of primers C (SEQ ID NO:9) and D (SEQ ID NO: 10) generated a 
specific PCR product of approximately 110 bps. 

The 110 bps PCR product was gel purified, l.gated into a plasmid, and 
transformed into E. coli XLl-Blue cells. Plasmid DNA was prepared from 41 
individual transformants and the inserts were sequenced. Four different insert 
sequences were identified, and were designated as probes 1 (SEQ ID NO:l 1). 2 (SEQ 
IDNO:12),4(SEQIDNO:13)and5(SEQIDNO:14). Probes 1 (SEQ ID NO" 11) 2 
(SEQ ID NO:12), 4 (SEQ ID NO:13) and 5 (SEQ ID NO:14) were used to screen a 
cDNA library made from mRNA extracted from wounded Grand fir stems, and the 
longest clone that hybridized to each of these probes was isolated and sequenced 
Thus, clone AGL28 (SEQ ID NO:15) is the longest cDNA clone that hybridized to 
probe I (SEQ ID NO:l 1), clone AG2.2 (SEQ ID NO:l) is the longest cDNA clone 
that hybridized to probe 2 (SEQ ID NO:12), clone AG4.30 (SEQ ID NO:17) is the 
longest cDNA clone that hybridized to probe 4 (SEQ ID NO: 13), and clone AG5.9 
(SEQ ID NO: 19) is the longest cDNA clone that hybridized to probe 5 (SEQ ID 
35 NO: 14). 
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Truncated clone AG 1.28 (SEQ ID NO: 15) resembled most closely in size and 
sequence (72% similarity, 49% identity) a diterpene cyclase, abietadiene synthase, 
from Grand fir. Clones AG4.30 (SEQ ID NO:I7) and AG5.9 (SEQ ID NO:19) 
encode sesquiterpene synthases. Sequence and functional analysis of clone AG2.2 
5 (SEQ ID NO:l) revealed that it encoded the monoterpene synthase, myrcene 
synthase. 

Alignment of the four new terpene synthase cDNA sequences AG 1.28 (SEQ 
ID NO:15), AG2.2 (SEQ ID NO:l), AG4.30 (SEQ ID NO:17) and AG5.9 (SEQ ID 
NO: 1 9) with that for abietadiene synthase (Stofer Vogel, B., Wildung, M.R., Vogel, 
10 G.. and Croteau, R. (1996) J. Biol. Chem. 271:23262-23268) allowed the 
identification of several, conserved sequence motifs. Two new sense PCR primers, 
primer E (SEQ ID NO:21) and primer F (SEQ ID NO:22) were designed based on 
the sequence of the conserved protein sequence motifs. A new antisense PCR 
primer, primer G (SEQ ID NO:23), was designed based on limited sequence 
i> information available from pinene synthase. The combination of primer E (SEQ ID 
NO:21) and primer G (SEQ ID NO:23) amplified a cDNA product of 1022 bps, 
which was designated as probe 3 (SEQ ID NO:24). 

Probe 3 (SEQ ID NO:24) was used to screen a cDNA library made from 
mRNA extracted from wounded Grand fir stems. Hybridization of 10 5 Grand fir 
/.ZAP II cDNA clones with probe 3 (SEQ ID NO:24) yielded two types of signals 
comprised of about 400 strongly positive clones and an equal number of weak 
positives, indicating that the probe recognized more than one type of cDNA. Thirty- 
lour of the former clones and eighteen of the latter were purified, the inserts were 
selected by size (2.0-2.5 kb), and the in vivo excised clones were partially sequenced 
from both ends. Those clones which afforded weak hybridization signals were 
shown to contain inserts that were either identical to myrcene synthase clone AG2.2 
(SEQ ID NO: I) or exhibited no significant sequence similarity to terpene synthases. 

Clones which gave strong hybridization signals segregated into distinct 
sequence groups represented by clone AG3JS (SEQ ID NO:3) and clone AGIO (SEQ 
ID NO:5). Both AG3. 18 (SEQ ID NO:3) and AGIO (SEQ ID NO:5) were subcloned 
into plasmid expression vectors and expressed in E. coli. When extracts of the 
induced cells were tested for terpene synthase activity with all of the potential prenyl 
diphosphate substrates, only geranyl diphosphate was utilized. Extracts from E. coli 
containing the AGIO (SEQ ID NO:5) expression construct converted geranyl 
diphosphate to the -4S enantiomer of limonene as the major product, indicating that 
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AGIO (SEQ ID NO:5) encodes (-)-limonene synthase. Similar analysis of the 
monoterpene products generated from geranyl diphosphate by cell-free extracts of 
E coli containing the AG3.18 (SEQ ID NO:3) insert ligated into an expression vector 
revealed the presence of a 42:58% mixture of a-pinene and p-pinene, the same 
5 product ratio previously described for the purified, native (-)-pinene synthase from 
Grand fn. These data indicate that AG3.18 (SEQ ID NO:3) encodes (-)-pinene 
synthase. 

The isolation of cDNAs encoding (-)-limonene synthase, (-)-pinene synthase 
and myrcene synthase permits the development of efficient expression systems for 
these functional enzymes; provides useful tools for examining the developmental 
regulation of monoterpene biosynthesis; permits investigation of the reaction 
mechanism(s) of these unusual, multiproduct enzymes, and permits the isolation of 
other (-)-limonene synthase, (-)-pinene synthase and myrcene synthases. The 
isolation of the (-)-limonene synthase, (-)-pinene synthase and myrcene synthase 
cDNAs also permits the transformation of a wide range of organisms in order to 
introduce monoterpene biosynthesis de novo, or to modify endogenous monoterpene 
biosynthesis. 

Although the (-)-limonene synthase, (-)-pinene synthase and myrcene 
synthase proteins set forth in SEQ ID NO:6, SEQ ID NO:4 and SEQ ID NO:2, 
respectively, direct the enzymes to plastids, substitution of the presumptive targeting 
sequence of each of these enzymes (SEQ ID NO:2, amino acids 1 to 61; SEQ ID 
NO:4, amino acids 1 to 61; SEQ ID NO:6, amino acids 1 to 66) with other transport 
sequences well known in the art (sec, e.g., vonlleijne ctal.. Eur. 
J. Biochem. 180:535-545, 1989; Stryer, Biochemistry, W.H. Freeman and Company, 
New York. NY, p. 769 [1988]) may be employed to direct the (-)-limonene synthase, 
(-)-pinene synthase and myrcene synthase to other cellular or extracellular locations. 

In addition to the native (-)-limonene synthase, (-)-pinene synthase and 
myrcene synthase amino acid sequences of SEQ ID NO:6, SEQ ID NO:4 and SEQ 
ID NO:2, respectively, sequence variants produced by deletions, substitutions, 
mutations and/or insertions are intended to be within the scope of the invention 
except insofar as limited by the prior art. The (-)-Iimonene synthase, (-)-pinene 
synthase and myrcene synthase amino acid sequence variants of this invention may 
be constructed by mutating the DNA sequences that encode the wild-type synthases, 
such as by using techniques commonly referred to as site-directed mutagenesis. 
Nucleic acid molecules encoding the monoterpene synthases of the present invention 
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can be mutated by a variety of PCR techniques well known to one of ordinary skill in 
the art. See. e.g., "PCR Strategies", M.A. lnnis, D.H. Gelfand and J.J. Sninsky, eds., 
1995. Academic Press, San Diego, CA (Chapter 14); "PCR Protocols: A Guide to 
Methods and Applications". M.A. lnnis, D.H. Gelfand, J.J. Sninsky and T.J. White, 
5 eds.. Academic Press. NY ( 1 990). 

By way of non-limiting example, the two primer system utilized in the 
Transformer Site-Directed Mutagenesis kit from Clonlech, may be employed for 
introducing site-directed mutants into the monoterpene synthase genes of the present 
invention. Following denaturation of the target plasmid in this system, two primers 
are simultaneously annealed to the plasmid; one of these primers contains the desired 
site-directed mutation, the other contains a mutation at another point in the plasmid 
resulting in elimination of a restriction site. Second strand synthesis is then carried 
out. tightly linking these two mutations, and the resulting plasmids are transformed 
into a „w,S strain of E. coli. Plasmid DNA is isolated from the transformed bacteria, 
restricted with the relevant restriction enzyme (thereby linearizing the unmutated 
plasmids). and then retransformed into E. coli. This system allows for generation of 
mutations directly in an expression plasmid, without the necessity of subcloning or 
generation of single-stranded phagemids. The light linkage of the two mutations and 
the subsequent linearization of unmutated plasmids results in high mutation 
efficiency and allows minimal screening. Following synthesis of the initial 
restriction site primer, this method requires the use of only one new primer type per 
mutation site. Rather than prepare each positional mutant separately, a set of 
''designed degenerate" oligonucleotide primers can be synthesized in' order to 
introduce all of the desired mutations at a given site simultaneously. Transformants 
can be screened by sequencing the plasmid DNA through the mutagenized region to 
identify and sort mutant clones. Each mutant DNA can then be restricted and 
analyzed by electrophoresis on Mutation Detection Enhancement gel (J.T. Baker) to 
confirm that no other alterations in the sequence have occurred (by band shift 
comparison to the unmutagenized control). 

The verified mutant duplexes in the pET (or other) overexpression vector can 
be employed to transform E. coli such as strain E. coli BL21(DE3)pLysS, for high 
level production of the mutant protein, and purification by standard protocols. The 
method of FAB-MS mapping can be employed to rapidly check the fidelity of mutant 
expression. This technique provides for sequencing segments throughout the whole 
protein and provides the necessary confidence in the sequence assignment. In a 



WO 99/02030 



PCT/US98/14528 



10 



mapping experiment of this type, protein is digested with a protease (the choice will 
depend on the specific region to be modified since this segment is of prime interest 
and the remaining map should be identical to the map of unmutagenized protein). 
The set of cleavage fragments is fractionated by microbore HPLC (reversed phase or 
ion exchange, again depending on the specific region to be modified) to provide 
several peptides in each fraction, and the molecular weights of the peptides are 
determined by FAB-MS. The masses are then compared to the molecular weights of 
peptides expected from the digestion of the predicted sequence, and the correctness 
of the sequence quickly ascertained. Since this mutagenesis approach to protein 
modification is directed, sequencing of the altered peptide should not be necessary if 
the MS agrees with prediction. If necessary to verify a changed residue, 
CAD-tandem MS/MS can be employed to sequence the peptides of the mixture in 
question, or the target peptide purified for subtractive Edman degradation or 
carboxypeptidase Y digestion depending on the location of the modification. 
15 In lhe design of a particular site directed mutagenesis, it is generally desirable 

to first make a non-conservative substitution (e.g., Ala for Cys, His or Glu) and 
determine if activity is greatly impaired as a consequence. The properties of the 
mutagenized protein are then examined with particular attention to the kinetic 
parameters of K m and k cat as sensitive indicators of altered function, from which 
changes in binding and/or catalysis per se may be deduced by comparison to the 
native enzyme. If the residue is by this means demonstrated to be important by 
activity impairment, or knockout, then conservative substitutions can be made, such 
as Asp for Glu to alter side chain length, Ser for Cys, or Arg for His. For 
hydrophobic segments, it is largely size that is usefully altered, although aromatics 
can also be substituted for alkyl side chains. Changes in the normal product 
distribution can indicate which step(s) of the reaction sequence have been altered by 
the mutation. Modification of the hydrophobic pocket can be employed to change 
binding conformations for substrates and result in altered regiochemistry and/or 
stereochemistry. 

Other site directed mutagenesis techniques may also be employed with the 
nucleotide sequences of the invention. For example, restriction endonuclease 
digestion of DNA followed by ligation may be used to generate deletion variants of 
(-)-hmonene synthase, (-)-pinene synthase and myrcene synthase, as described in 
section 15.3 of Sambrook et al. (Molecular Cloning: A Laboratory Manual 2nd Ed., 
Cold Spring Harbor Laboratory Press, New York, NY [1989]). A similar strategy 
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may be used to construct insertion variants, as described in section 15.3 of 

Sambrook et al., supra. 

Olinonucleotide-directed mutagenesis may also be employed for preparing 
substitution variants of this invention. It may also be used to conveniently prepare 
5 the deletion and insertion variants of this invention. This technique is well known in 
the art as descr ibed by Adelmanetal. (DA/1 2:183 [1983]); Sambrook et al., supra: 
"Current Protocols in Molecular Biology", 1991, Wiley (NY), F.T. Ausubel, R. 
Brent, R.E. Kingston, D.D. Moore, J.D. Seidman, J. A. Smith and K. Struhl, eds. 

Generally, oligonucleotides of at least 25 nucleotides in length are used to 
10 insert, delete or substitute two or more nucleotides in the (-)-limonene synthase, 
(-)-pinene synthase and myrcene synthase molecule. An optimal oligonucleotide will 
have 12 to 15 perfectly matched nucleotides on either side of the nucleotides coding 
for the mutation. To mutagenize wild-type (-)-limonene synthase, (-)-pinene 
synthase and myrcene synthase, the oligonucleotide is annealed to the single-stranded 
15 DNA template molecule under suitable hybridization conditions. A DNA 
polymerizing enzyme, usually the Klenow fragment of E. coll DNA polymerase I, is 
then added. This enzyme uses the oligonucleotide as a primer to complete the 
synthesis of the mutation-bearing strand of DNA. Thus, a heteroduplex molecule is 
formed such that one strand of DNA encodes the wild-type synthase inserted in the 
20 vector, and the second strand of DNA encodes the mutated form of the synthase 
inserted into the same vector. This heteroduplex molecule is then transformed into a 
suitable host cell. 

Mutants with more than one amino acid substituted may be generated in one 
of several ways. If the amino acids are located close together in the polypeptide 
25 chain, they may be mutated simultaneously using one oligonucleotide that codes for 
all of the desired amino acid substitutions. If however, the amino acids are located 
some distance from each other (separated by more than ten amino acids, for example) 
it is more difficult to generate a single oligonucleotide that encodes all of the desired 
changes. Instead, one of two alternative methods may be employed. In the first 
30 method, a separate oligonucleotide is generated for each amino acid to be substituted. 
The oligonucleotides are then annealed to the single-stranded template DNA 
simultaneously, and the second strand of DNA that is synthesized from the template 
will encode all of the desired amino acid substitutions. An alternative method 
involves two or more rounds of mutagenesis to produce the desired mutant. The first 
35 round is as described for the single mutants: wild-type (-)-limonene synthase, 
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(-)-pinene synthase and myrcene synthase DNA is used for the template, an 
oligonucleotide encoding the first desired amino acid substitution(s) is annealed to 
this template, and the heteroduplex DNA molecule is then generated. The second 
round of mutagenesis utilizes the mutated DNA produced in the first round of 
mutagenesis as the template. Thus, this template already contains one or more 
mutations. The oligonucleotide encoding the additional desired amino acid 
substitulion(s) is then annealed to this template, and the resulting strand of DNA now 
encodes mutations from both the first and second rounds of mutagenesis. This 
resultant DNA can be used as a template in a third round of mutagenesis, and so on. 

A gene encoding (-)-limonene. synthase, (-)-pinene synthase and myrcene 
synthase may be incorporated into any'organism (intact plant, animal, microbe, etc.), 
or cell culture derived therefrom, that produces geranyl diphosphate. A (-)-limonene 
synthase, (-)-pinene synthase and myrcene synthase gene may be introduced into any 
organism for a variety of purposes including, but not limited to: production of 
1 5 (-)-limoiiene synthase, (-)-pinene synthase and myrcene synthase, or their products; 
production or modification of flavor and aroma properties: improvement of defense 
capability, and the alteration of other ecological interactions mediated by myrcene, 
(-)-limonene, (-)-pinene, or their derivatives. 

Eukaryotic expression systems may be utilized for the production of 
20 (-)-limonene synthase, (-)-pinene synthase and myrcene synthase since they are 
capable of carrying out any required posttranslational modifications and of directing 
the enzymes to the proper membrane location. A representative eukaryotic 
expression system for this purpose uses the recombinant baculo virus, Auiographa 
calijormca nuclear polyhidrosis virus (AcNPV; M.D. Summers and G.E. Smith, A 
25 Manual of Methods for Baculovirus Vectors and Insect Cell Culture Procedures 
[1986]; Luckowetal., Bio-technology 6:47-55 [1987]) for expression of the 
terpenoid synthases of the invention. Infection of insect cells (such as cells of the 
species Spodopiera frugiperda) with the recombinant baculoviruses allows for the 
production of large amounts of the monoterpenoid synthase proteins. In addition, the 
baculovirus system has other important advantages for the production of recombinant 
(-)-limonene synthase, (-)-pinene synthase and myrcene synthase. For example, 
baculoviruses do not infect humans and can therefore be safely handled in large 
quantities. In the baculovirus system, a DNA construct is prepared including a DNA 
segment encoding (-)-limonene synthase, (-)-pinene synthase and myrcene synthase 
and a vector. The vector may comprise the polyhedron gene promoter region of a 
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baculovirus, the baculovirus flanking sequences necessary for proper cross-over 
during recombination (the flanking sequences comprise about 200-300 base pairs 
adjacent to the promoter sequence) and a bacterial origin of replication which permits 
the construct to replicate in bacteria. The vector is constructed so that (i) the DNA 
5 segment is placed adjacent (or operably linked or "downstream" or "under the control 
of) to the polyhedron gene promoter and (ii) the promoter/monoterpene synthase 
combination is flanked on both sides by 200-300 base pairs of baculovirus DNA (the 
flanking sequences). 

To produce the monoterpene synthase DNA construct, a cDNA clone 
encoding the full length (-)-limonene synthase, (-)-pinene synthase and myrcene 
synthase is obtained using methods such as those described herein. The DNA 
construct is contacted in a host cell with baculovirus DNA of an appropriate 
baculovirus (that is, of the same species of baculovirus as the promoter encoded in 
the construct) under conditions such that recombination is effected. The resulting 
recombinant baculoviruses encode the full (-)-limonene synthase, (-)-pinene synthase 
and myrcene synthase. For example, an insect host cell can be cotransfected or 
transfected separately with the DNA construct and a functional baculovirus. 
Resulting recombinant baculoviruses can then be isolated and used to infect cells to 
effect production of the monoterpene synthase. Host insect cells include, for 
example. Spodoptera frugiperda cells, that are capable of producing a baculovirus- 
expressed monoterpene synthase. Insect host cells infected with a recombinant 
baculovirus of the present invention are then cultured under conditions allowing 
expression of the baculovirus-encoded (-)-limonene synthase, (-)-pinene synthase and 
myrcene synthase, (-)-limonene synthase, (-)-pinene synthase and myrcene synthase 
25 thus produced are then extracted from the cells using methods known in the art. 

Other eukaryolic microbes such as yeasts may also be used to practice this 
invention. The baker's yeast Saccharomyces cerevisiae, is a commonly used yeast, 
although several other strains are available. The plasmid YRp7 (Stinchcomb et al.. 
Nature 282:39 [1979]; Kingsman et al.. Gene 7:141 [1979]; Tschemper et al., Gene 
10:157 [1980]) is commonly used as an expression vector in Saccharomyces. This 
plasmid contains the trpl gene that provides a selection marker for a mutant strain of 
yeast lacking the ability to grow in tryptophan, such as strains ATCC No. 44,076 and 
PEP4-1 (Jones, Genetics 85:12 [1977]). The presence of the trpl lesion as a 
characteristic of the yeast host cell genome then provides an effective environment 
for detecting transformation by growth in the absence of tryptophan. Yeast host cells 
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are generally transformed using the polyethylene glycol method, as described by 
llinnen (Proc. Nail. Acad. Sci. USA 75:1929 [1978]). Additional yeast 
transformation protocols are set forth in Gietz et al., N.A.R. 20(17) 1425(1992); 
Reeves et al., FEMS 99(2-3 ): 1 93- 1 97, (1992). 

Suitable promoting sequences in yeast vectors include the promoters for 
3-phosphoglycerate kinase (Hitzeman et al.. ./. Biol. Chem. 255:2073 [1980]) or other 
glycolytic enzymes (Hessetal., J. Adv. Enzyme Reg. 7:149 [1968]; Holland et al., 
Biochemistry 17:4900 [1978]), such as enolase, glyceraldehyde-3-phosphate 
dehydrogenase. hexokinase, pyruvate decarboxylase, phosphofructokinase, 
glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, 
inoscphospluitc isomerase, phosphoglucose isomerase, and glucokinase. In the 
construction of suitable expression plasmids, the termination sequences associated 
with these genes are also ligated into the expression vector 3' of the sequence desired 
to be expressed to provide polyadenylation of the mRNA and termination. Other 
promoters that have the additional advantage of transcription controlled by growth 
conditions are the promoter region for alcohol dehydrogenase 2, isocytochrome C, 
acid phosphatase, degradative enzymes associated with nitrogen metabolism, and the 
aforementioned glyceraldehyde-3-phosphate dehydrogenase, and enzymes 
responsible for maltose and galactose utilization. Any plasmid vector containing 
yeast-compatible promoter, origin of replication and termination sequences is 
suitable. 

Cell cultures derived from multicellular organisms, such as plants, may be 
used as hosts to practice this invention. Transgenic plants can be obtained, for 
example, by transferring plasmids that encode (-)-limonene synthase, (-)-pinene 
synthase and myrcene synthase and a selectable marker gene, e.g., the kan gene 
encoding resistance to kanamycin, into Agrobactehum tumifaciens containing a 
helper Ti plasmid as described in Hoeckema et al., Ato/wre 303: 1 79-1 8 1 [1983] and 
culturing the Agrobaclerium cells with leaf slices of the plant to be transformed as 
described by Anetal., Plant Physiology 81 :301-305 [1986]. Transformation of 
cultured plant host cells is normally accomplished through Agrobaclerium 
lumifaciens, as described above. Cultures of mammalian host cells and other host 
cells that do not have rigid cell membrane barriers are usually transformed using the 
calcium phosphate method as originally described by Graham and Van der Eb 
( Virology 52:546 [1978]) and modified as described in sections 16.32-16.37 of 
Sambrook et al., supra. However, other methods for introducing DNA into cells 



19 



WO 99/02030 



PCT7US98/14528 



10 



15 



20 



25 



30 



such as Polybrene (Kawai and Nishizavva. Mai. Cell. Biol. 4:1 172 [1984]), protoplast 
fusion (Schaffner, Proc. Natl. Acad. Sci. USA 77:2163 [1980]), electroporation 
(Neumann etal.. EMBOJ. 1:841 [1982]), and direct microinjection into nuclei 
(Capecchi. Cell 22:479 [1980]) may also be used. Additionally, animal 
transformation strategies are reviewed in Monastersky G.M. and Robl, J.M., 
Strategies in Transgenic Animal Science, ASM Press, Washington, D.C., 1995' 
Transformed plant calli may be selected through the selectable marker by growing 
the cells on a medium containing, e.g.. kanamycm, and appropriate amounts of 
phytohormone such as naphthalene acetic acid and benzyladenine for callus and 
shoot induction. The plant cells may then be regenerated and the resulting plants 
transferred to soil using techniques well known to those skilled in the art. 

In addition, a gene regulating (-)-limonene synthase, (-)-pinene synthase and 
myrcene synthase production can be incorporated into the plant along with a 
necessary promoter which is inducible. In the practice of this embodiment of the 
invention, a promoter that only responds to a specific external or internal stimulus is 
luscd to the target cDNA. Thus, the gene will not be transcribed except in response 
to the specific stimulus. As long as the gene is not being transcribed, its gene 
product is not produced. 

An illustrative example of a responsive promoter system that can be used in 
the practice of this invention is the glutathione-S-transferase (GST) system in maize. 
GSTs are a family of enzymes that can detoxify a number of hydrophobic 
electrophilic compounds that often are used as pre-emergent herbicides 
(Weigand etal.. Plant Molecular Biology 7:235-243 11986]). Studies have shown 
that the GSTs are directly involved in causing this enhanced herbicide tolerance. 
This action is primarily mediated through a specific 1.1 kb mRNA transcription 
product. In short, maize has a naturally occurring quiescent gene already present that 
can respond to external stimuli and that can be induced to produce a gene product 
Dm gene has previously been identified and cloned. Thus, in one embodiment of 
Hits .mention, the promoter is removed from the GST responsive gene and attached 
to a (-)-limonene synthase, (-)-pinene synthase and myrcene synthase gene that 
previously has had its native promoter removed. This engineered gene is the 
combination of a promoter that responds to an external chemical stimulus and a gene 
responsible for successful production of (-)-limonene synthase, (-)-pinene synthase 
and myrcene synthase. 
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In addition to the methods described above, several methods are known in the 
art for transferring cloned DNA into a wide variety of plant species, including 
gymnosperms, angiosperms, monocots and dicots (see, e.g., Glick and 
Thompson, eds., Methods in Plant Molecular Biology, CRC Press, Boca Raton, 
Florida [1993]). Representative examples include electroporntion-facilitated DNA 
uptake by protoplasts (Rhodes et al.. Science 240(4849):204-207 [1988]); treatment 
of protoplasts with polyethylene glycol (Lyznik et al., Plant Molecular 
Biology 13:151-161 [1989]); and bombardment of cells with DNA laden 
miciop.ojecliles (Klein et al., Plant Physiol. 91:440-444 [1989] and Boynlon el al., 
Scen e 240(4858): 1534-1538 [1988]). Additionally, plant transformation strategies 
:iiul techniques are reviewed in Birch, R.G.. Ann Rev Plant Phys Plant Mol 
/W 48:297 (1997 ); Forester et al., Exp. Agric. 33:15-33 (1997). Minor variations 
make these technologies applicable to a broad range of plant species. 

Luch of these techniques has advantages and disadvantages. In each of the 
techniques, DNA from a plasmid is genetically engineered such that it contains not 
onls the gene of interest, but also selectable and screenable marker genes. A 
selectable marker gene is used to select only those cells that have integrated copies of 
the plasmid (the construction is such that the gene of interest and the selectable and 
screenable genes are transferred as a unit). The screenable gene provides another 
check for the successful culturing of only those cells carrying the genes of interest. A 
commonly used selectable marker gene is neomycin phosphotransferase II (NPT II). 
This gene conveys resistance to kanamycin, a compound that can be added directly to 
the growth media on which the cells grow. Plant cells are normally susceptible to 
kanamycin and, as a result, die. The presence of the NPT II gene overcomes the 
effects of the kanamycin and each cell with this gene remains viable. Another 
selectable marker gene which can be employed in the practice of this invention is the 
gene which confers resistance to the herbicide glufosinate (Basta). A screenable gene 
commonly used is the P-glucuronidase gene (GUS). The presence of this gene is 
characterized using a histochemical reaction in which a sample of putatively 
transformed cells is treated with a GUS assay solution. After an appropriate 
incubation, the cells containing the GUS gene turn blue. 

The plasmid containing one or more of these genes is introduced into either 
plant protoplasts or callus cells by any of the previously mentioned techniques. If the 
marker gene is a selectable gene, only those cells that have incorporated the DNA 
package survive under selection with the appropriate phytotoxic agent. Once the 



WO 99/02030 



PCT/US98/14528 



appropriate cells are identified and propagated, plants are regenerated. Progeny from 
the transformed plants must be tested to insure that the DNA package has been 
successfully integrated into the plant genome. 

Mammalian host cells may also be used in the practice of the invention. 
5 Examples of suitable mammalian cell lines include monkey kidney CV1 line 
transformed by SV40 (COS-7, ATCC CRL 1651); human embryonic kidney 
line 293S (Graham et ah, J. Gen. Virol 36:59 [1977]); baby hamster kidney cells 
(BHK, ATCC CCL 10); Chinese hamster ovary cells (Urlab and Chasin, Proc. Natl. 
Acad. Sci USA 77:4216 [1980]); mouse Sertoli cells (TM4, Mather, Biol 
10 Rcprod. 23:243 [1980]); monkey kidney cells (CVI-76, ATCC CCL 70); African 
green monkey kidney cells (VERO-76, ATCC CRL-1 587); human cervical 
carcinoma cells (HELA, ATCC CCL 2); canine kidney cells (MDCK, 
ATCC CCL 34); buffalo rat liver cells (BRL 3A, ATCC CRL 1442); human lung 
cells (W138, ATCC CCL 75); human liver cells (Hep G2, HB 8065); mouse 
15 mammary tumor cells (MMT 060562, ATCC CCL 51); rat hepatoma cells (HTC, 
MI.54. Baumannetal., J. Cell Bwl 85:1 [1980]); and TRI cells (Mather et al., 
Annals N.Y. Acad. Sci. 383:44 [1982]). Expression vectors for these cells ordinarily 
include (if necessary) DNA sequences for an origin of replication, a promoter located 
in front of the gene to be expressed, a ribosome binding site, an RNA splice site, a 
20 polyadenylation site, and a transcription terminator site. 

Promoters used in mammalian expression vectors are often of viral origin. 
These viral promoters are commonly derived from polyoma virus, Adenovirus 2, and 
most frequently Simian Virus 40 (SV40). The SV40 virus contains two promoters 
that are termed the early and late promoters. These promoters are particularly useful 
25 because they are both easily obtained from the virus as one DNA fragment that also 
contains the viral origin of replication (Fiers et al., Nature 273:1 13 [1978]). Smaller 
or larger SV40 DNA fragments may also be used, provided they contain the 
approximately 250-bp sequence extending from the Hindlll site toward the Bgll site 
located in the viral origin of replication. 
30 Alternatively, promoters that are naturally associated with the foreign gene 

(homologous promoters) may be used provided that they are compatible with the host 
cell line selected for transformation. 

An origin of replication may be obtained from an exogenous source, such as 
SV40 or other virus (e.g., Polyoma, Adeno, VSV, BPV) and inserted into the cloning 
35 vector. Alternatively, the origin of replication may be provided by the host cell 
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chromosomal replication mechanism. If the vector containing the foreign gene is 
integrated into the host cell chromosome, the latter is often sufficient. 

The use of a secondary DNA coding sequence can enhance production levels 
of (-)-limonene synthase, (-)-pinene synthase and myrcene synthase in transformed 
cell lines. The secondary coding sequence typically comprises the enzyme 
dihydrofolate reductase (DHFR). The wild-type form of DHFR is normally inhibited 
by the chemical methotrexate (MTX). The level of DHFR expression in a cell will 
vary depending on the amount of MTX added to the cultured host cells. An 
additional feature of DHFR that makes it particularly useful as a secondary sequence 
is that it can be used as a selection marker to identify transformed cells. Two forms 
ol DHFR arc available for use as secondary sequences, wild-type DHFR and MTX- 
resistant DHFR. The type of DHFR used in a particular host cell depends on whether 
the host cell is DHFR deficient (such that it either produces very low levels of DHFR 
cndogcnously, or it does not produce functional DHFR at all). DHFR-dcficient cell 
lines such as the CHO cell line described by Urlaub and Chasin, supra, are 
transformed with wild-type DHFR coding sequences. After transformation, these 
DHFR-deficient cell lines express functional DHFR and are capable of growing in a 
culture medium lacking the nutrients hypoxanthine, glycine and thymidine. 
Nontransformed cells will not survive in this medium. 

The MTX-resistant form of DHFR can be used as a means of selecting for 
transformed host cells in those host cells that endogenously produce normal amounts 
of functional DHFR that is MTX sensitive. The CHO-K1 cell line (ATCC 
No. CL61) possesses these characteristics, and is thus a useful cell line for this 
purpose. The addition of MTX to the cell culture medium will permit only those 
cells transformed with the DNA encoding the MTX-resistant DHFR to grow. The 
nontransformed cells will be unable to survive in this medium. 

Prokaryotes may also be used as host cells for the initial cloning steps of this 
invention. They are particularly useful for rapid production of large amounts of 
DNA. for production ol" single-stranded DNA templates used for site-directed 
mutagenesis, for screening many mutants simultaneously, and for DNA sequencing 
of the mutants generated. Suitable prokaryotic host cells include E. coli K12 
strain 94 (ATCC No. 31.446), E. coli strain W3110 (ATCC No. 27,325) E. coli 
XI 776 (ATCC No. 31,537). and E. coli B; however many other strains of E coli 
such as HB101, JM101. NM522, NM538, NM539, and many other species and 
genera of prokaryotes including bacilli such as Bacillus subtilis, other 
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enterobacteriaceae such as Salmonella typhimurium or Serratia marcesans, and 
various PxeuJomonas species may all be used as hosts. Prokaryotic host cells or 
other host cells with rigid cell walls are preferably transformed using the calcium 
chloride method as described in section 1.82 of Sambrook et al., supra. 
Alternatively, electroporation may be used for transformation of these cells. 
Prokaryote transformation techniques are set forth in Dower, W.J., in Genetic 
Engineering. Principles and Methods. 12:275-296. Plenum Publishing Corp., 1990; 
llanahan et al.. Meth. Enxymol.. 204:63 (1991). 

As a representative example, cDNA sequences encoding (-)-limonene 
s>nihuse. (-)-pinene synthase or myrcene synthase may be transferred to the 
( I lis),, -Tag pET vector commercially available (from Novagen) for overexpression in 
/. oh as heterologous host. This pET expression plasmid has several advantages in 
Ind, k\el heterologous expression systems. The desired cDNA insert is ligated in 
tramc to plasmid vector sequences encoding six histidines followed by a highly 
specific protease recognition site (thrombin) that are joined to the amino terminus 
codon of the target protein. The histidine "block" of the expressed fusion protein 
promotes very tight binding to immobilized metal ions and permits rapid purification 
of the recombinant protein by immobilized metal ion affinity chromatography. The 
histidine leader sequence is then cleaved at the specific proteolysis site by treatment 
of the purified protein with thrombin, and the (-)-limonene synthase, (-)-pinene 
synthase and myrcene synthase again purified by immobilized metal ion affinity 
chromatography, this time using a shallower imidazole gradient to elute the 
recombinant synthases while leaving the histidine block still adsorbed. This 
overexpression-purification system has high capacity, excellent resolving power and 
is fast, and the chance of a contaminating E coli protein exhibiting similar binding 
behavior (before and after thrombin proteolysis) is extremely small. 

As will be apparent to those skilled in the art, any plasmid vectors containing 
replicon and control sequences that are derived from species compatible with the host 
cell may also be used in the practice of the invention. The vector usually has a 
replication site, marker genes that provide phenotypic selection in transformed cells, 
one or more promoters, and a polylinker region containing several restriction sites for 
insertion of foreign DNA. Plasmids typically used for transformation of E. coli 
include pBR322, pUC18, P UC19, P UCI18, P UC119, and Bluescript M13, all of 
which are described in sections 1.12-1.20 of Sambrook et al., supra. However, many 
other suitable vectors are available as well. These vectors contain genes coding for 
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ampicillin and/or tetracycline resistance which enables cells transformed with these 
vectors to grow in the presence of these antibiotics. 

The promoters most commonly used in prokaryotic vectors include the 
p-lactamase (penicillinase) and lactose promoter systems (Chang et al. 
5 Nature 375:615 [1978]; Itakura et al., Science 198: 1056 [1977]; Goeddel et al., 
Nature 281:544 [1979]) and a tryptophan (trp) promoter system (Goeddel et al., 
NucL Acids Res. 8:4057 [1980]; EPO Appl. PubL No. 36,776), and the alkaline 
phosphatase systems. While these are the most commonly used, other microbial 
promoters have been utilized, and details concerning their nucleotide sequences have 
been published, enabling a skilled worker to ligate them functionally into plasmid 
vectors (see Siebenlist et al., Cell 20:269 [1980]). 

Many eukaryotic proteins normally secreted from the cell contain an 
endogenous secretion signal sequence as part of the amino acid sequence. Thus, 
proteins normally found in the cytoplasm can be targeted for secretion by linking a 
15 signal sequence to the protein. This is readily accomplished by ligating DNA 
encoding a signal sequence to the 5* end of the DNA encoding the protein and then 
expressing this fusion protein in an appropriate host cell. The DNA encoding the 
signal sequence may be obtained as a restriction fragment from any gene encoding a 
protein with a signal sequence. Thus, prokaryotic, yeast, and eukaryotic signal 
20 sequences may be used herein, depending on the type of host cell utilized to practice 
the invention. The DNA and amino acid sequence encoding the signal sequence 
portion of several eukaryotic genes including, for example, human growth hormone, 
proinsulin, and prealbumin are known (see Stryer, Biochemistry W.H. Freeman and 
Company, New York, NY, p. 769 [1988]), and can be used as signal sequences in 
appropriate eukaryotic host celts. Yeast signal sequences, as for example acid 
phosphatase (Arima et ah, Nuc. Acids 7^.11:1657 [1983]), a-factor, alkaline 
phosphatase and invertase may be used to direct secretion from yeast host cells. 
Prokaryotic signal sequences from genes encoding, for example, LamB or OmpF 
(Wong et al.. Gene 68:193 [1988]), MalE, PhoA, or beta-lactamase, as well as other 
30 genes, may be used to target proteins from prokaryotic cells into the culture medium. 

Trafficking sequences from plants, animals and microbes can be employed in 
the practice of the invention to direct the monoteipene synthase proteins of the 
present invention to the cytoplasm, endoplasmic reticulum, mitochondria or other 
cellular components, or to target the protein for export to the medium. These 
35 considerations apply to the overexpression of (-)-limonene synthase, (-)-pinene 
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synthase and myrcene synthase, and to direction of expression within cells or intact 
organisms to permit gene product function in any desired location. 

The construction of suitable vectors containing DNA encoding replication 
sequences, regulatory sequences, phenotypic selection genes and the monoterpene 
synthase DNA of interest are prepared using standard recombinant DNA procedures. 
Isolated plasmids and DNA fragments are cleaved, tailored, and ligated together in a 
specific order to generate the desired vectors, as is well known in the art (see. for 
example, Maniatis, supra, and Sambrook et al., supra). 

As discussed above, (-)-limonene synthase, (-)-pinene synthase and myrcene 
synthase variants are preferably produced by means of mulalion(s) that are generated 
using the method of site-specific mutagenesis. This method requires the synthesis 
and use of specific oligonucleotides that encode both the sequence of the desired 
mutation and a sufficient number ofadjace.it nucleotides to allow the oligonucleotide 
to stably hybridize to the DNA template. 

The foregoing may be more fully understood in connection with the 
following representative examples, in which "Plasmids" are designated by a lower 
case p followed by an alphanumeric designation. The starting plasmids used in this 
invention are either commercially available, publicly available on an unrestricted 
basis, or can be constructed from such available plasmids using published 
procedures. In addition, other equivalent plasmids are known in the art and will be 
apparent to the ordinary artisan. 

"Digestion", "cutting" or "cleaving" of DNA refers to catalytic cleavage of 
the DNA with an enzyme that acts only at particular locations in the DNA. These 
enzymes are called restriction endonucleases, and the site along the DNA sequence 
where each enzyme cleaves is called a restriction site. The restriction enzymes used 
in this invention are commercially available and are used according to the 
instructions supplied by the manufacturers. (See also sections 1 .60-1 .61 and 
sections 3.38-3.39 of Sambrook et al., supra.) 

"Recovery" or "isolation" of a given fragment of DNA from a restriction 
digest means separation of the resulting DNA fragment on a polyacrylamide or an 
agarose gel by electrophoresis, identification of the fragment of interest by 
comparison of its mobility versus that of marker DNA fragments of known molecular 
weight, removal of the gel section containing the desired fragment, and separation of 
the gel from DNA. This procedure is known generally. For example, see Lawn et al. 
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(Nucleic Acids Res. 9:61 03-61 14 [1982]), and Goeddel et al. (Nucleic Acids Res., 
supra). 

The following examples merely illustrate the best mode now contemplated 
for practicing the invention, but should not be construed to limit the invention. All 
5 literature citations herein are expressly incorporated by reference. 

EXAMPLE 1 

PCR-Bas ed Generation of Probes for Cloning Monotcrpene Synthases From 

Grand fir (Abies grandis) 
Substrates, Reagents and cDNA Library - [ 1 - 3 H]Geranyl diphosphate (250 
Ci/mol) (Croteau, R., Alonso, W.R., Koepp, A.E., and Johnson, M.A. (1994) Arch. 
Biochem. Biophys. 309: 1 84- 1 92), [l- 3 H]farnesyl diphosphate (125 Ci/mol) (Dehal, 
S.S and Croteau, R. (1988) Arch. Biochem. Biophys. 261:346-356) and 
lljgeranylgeranyl diphosphate (120 Ci/mol) (LaFever, R.E., Stofer Vogel, B., 
and Croteau, R. (1994) Arch. Biochem. Biophys. 313:139-149) were prepared as 
15 described previously. Terpenoid standards were from our own collection. All other 
biochemicals and reagents were purchased from Sigma Chemical Co. or Aldrich 
Chemical Co., unless otherwise noted. Construction of the aZAP II cDNA library, 
using inRNA isolated from wounded Grand fir sapling stems, was described 
previously (Stofer Vogel, B., Wildung, M.R., Vogel, G., and Croteau, R. (1996) 
20 J. Biol. Chem. 271:23262-23268). 

PCR-Based Probe Generation - Based on comparison of sequences of 
limonene synthase from spearmint (Colby, S.M., Alonso, W.R., Katahira, E.J., 
McGarvey, D.J., and Croteau, R. (1993) J.Biol. Chem. 268:2301 6- 
23024), 5-t7;/-aristolochene synthase from tobacco (Facchini, P.J., and Chappell, 
25 J. (1992) Proc. Natl. Acad. Sci. USA 89:11088-11092), and casbene synthase from 
castor bean (Mau, C.J.D., and West, C.A. (1994) Proc. Natl. Acad. Sci. 
(7^ 91:8497-8501), four conserved regions were identified for which a set of 
consensus degenerate primers were synthesized: Primer A (SEQ ID NO:7); Primer B 
(SEQ ID NO:8); Primer C (SEQ ID NO:9); Primer D (SEQ ID NO: 10). Primers A 
(SEQ ID NO:7), B (SEQ ID NO:8) and C (SEQ ID NO:9) have been described 
previously (Steele, C. Lewinsohn, E. and Croteau, R., Proc. Nat'l. Acad Sci USA, 
92: 4164-4168 (1995)); primer D (SEQ ID NO: 10) was designed based on the 
conserved amino acid sequence motif DD(T/1)(I/Y/F)D(A/V)Y(A/G)(SEQ ID 
NO:25) of the above noted terpene synthases (Colby, S.M., Alonso, W.R., Katahira, 
35 E.J., McGarvey, D.J., and Croteau, R. (1993) J.Biol. Chem. 268:23016-23024; 
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Facchini. P.J., and Chappell, J. (1992) Pruc. Natl. Acad. Sci. USA 89:11088-11092; 
Mau. C.J.D.. and West, C.A. (1994) Proc. Natl. Acad. Sci. USA 91:8497-8501). 

Each of the sense primers, A (SEQ ID NO:7), B (SEQ ID NO:8) and D (SEQ 
ID NO: 10). was used for PCR in combination with antisense primer C (SEQ ID 
NO:9) by employing a broad range of amplification conditions. PCR was performed 
in a total volume of 50 ul containing 20 mM Tris/HCl (tris(hydroxymethyl ) 
aminomethane/HCl, pH 8.4), 50 mM KC1, 5 mM MgCl 2 , 200 uM of each dNTP, 1-5 
uM of each primer, 2.5 units of Tacj polymerase (BRL) and 5 p.! of purified Grand fir 
stem cDNA library phage as template (1.5 x 10 9 pfu/ml). Analysis of the PCR 
reaction products by agarose gel electrophoresis (Sambrock, J., Fritsch, E.F.. and 
Maniaiis. 1. (1989) Molecular ( lonin^ A Laboratory Manual, 2nd Ed., Cold Spring 
Harbor Laboratory, Cold Spring Harbor, NY) revealed that only the combination of 
primers C and D generated a specific PCR product of approximately 1 10 bps (base 
pairs). This PCR product was gel purified, ligated into pT7Blue (Novagen). and 
transformed into E. coli XL 1 -Blue cells. Plasmid DNA was prepared from 41 
individual transformants and the inserts were sequenced (DyeDeoxy Terminator 
Cycle Sequencing, Applied Biosystems). Four different insert sequences were 
identified, and were designated as probes 1 (SEQ ID NO:l 1), 2 (SEQ ID NO: 12), 4 
(SEQ IDNO:13)and5 (SEQ ID NO: 14). 

Subsequent isolation of four new cDNA species {AG 1.28 (SEQ ID NO: 15); 
AG 2.2 (SEQ ID NO:l); AG4.30 (SEQ ID NO: 17) and AGS. 9 (SEQ ID NO: 19))', 
encoding terpene synthases from Grand fir corresponding to probes 1 (SEQ ID 
NO:ll),2 (SEQ ID NO:12),4 (SEQ ID NO:13) and 5 (SEQ ID NO:14), 
respectively, allowed the identification of three additional conserved sequence 
elements which were used to design a set of three new PCR primers: Primer E 
(5'-GGI GA(A/G) A(A/C)(A/G) (A/G)T1 ATG GA(A/G) GA(A/G) GC-3')(SEQ ID 
NO:21); Primer F (5'-GA(A/G) (C/T)T1 CA(G/A) (C/T)TI (A/C/T)(C/GAr)l (A/C)GI 
TGG TGG-3')(SEQ ID NO:22) and Primer G (5'-CCA (A/G)TT IA(A7G) ICC 
(C/T)TT 1AC (A/G)TC-3')(SEQ ID NO:23). 

Degenerate primer E (SEQ ID NO:21) was designed to conserved element 
GE(K/T)(V/I)M(E/D)EA (SEQ ID NO:26) and degenerate primer F (SEQ ID NO:22) 
was designed to conserved element Q(F/Y/D)(I/L)(T/L/R)RWW ( SEQ ID NO:27) by 
comparing the sequences of five cloned terpene synthases from Grand fir: a 
monoterpene synthase corresponding to probe 2 (SEQ ID NO: 12), two sesquiterpene 
synthases corresponding to probe 4 (SEQ ID NO:13) and probe 5 (SEQ ID NO:14), 
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respectively, a previously described diterpene synthase (Stofer Vogel, B., Wildung, 
M.R.. Vogel, G., and Croteau, R. (1996) J. Biol. Chem. 271:23262-23268), and a 
truncated terpene synthase corresponding to probe 1 (SEQ ID NO:l 1). Degenerate 
primer G (SEQ ID NO:23) was designed according to the amino acid sequence 
DVIKG(F/L)NW (SEQ ID NO:28) obtained from a peptide generated by trypsin 
digestion of purified (-)-pinene synthase from Grand fir. Primers E (SEQ ID NO:21 ) 
and F (SEQ ID NO:22) were independently used for PGR amplification in 
combination with primer G (SEQ ID NO:23). with Grand fir stem cDNA library as 
template. The combination of primers E (SEQ ID NO:21) and G (SEQ ID NO:23) 
yielded a specific PCR product of approximately 1020 bps. This PCR product was 
ligated into pT7Blue and transformed into E coli XL 1 -Blue. Plasmid DNA was 
prepared from 20 individual transformants and inserts were sequenced from both 
ends. The sequence of this 1022 bp insert was identical for all 20 plasmids and was 
designated as probe 3 (SEQ ID NO:24). 

,:> EXAMPLE 2 

Screening a Wounded Grand fir Stem cDNA Library 

For library screening, 100 ng of each probe was amplified by PCR, gel 
purif,ed, randomly labeled with [a- 32 P]dATP (Feinberg, A.P., and Vogelstein, B. 
(1984) Anal. Diuchem. 137:266-267), and used individually to screen replica filters 
of 10 plaques of the wound-induced Grand fir stem cDNA library plated on E. coli 
LE392. Hybridization with probes 1 (SEQ ID NO:l 1), 2 (SEQ ID NO: 12), 4 (SEQ 
ID NO: 13) and 5 (SEQ ID NO: 14) was performed for 14 h at 65 °C in 3 x SSPE and 
0.1% SDS. Filters were washed three times for 10 min at 55 °C in 3 x SSPE with 
0.1% SDS and exposed for 12 h to Kodak XAR film at -70 °C. All of the XZAPII 
clones yielding positive signals were purified through a second round of 
hybridization (probe 1 (SEQ ID NO:l 1) gave 25 positives, probe 2 (SEQ ID NO: 12) 
gave 16 positives, probe 4 (SEQ ID NO: 13) gave 49 positives and probe 5 (SEQ ID 
NO. 14) gave 12 positi ves). 

I lybridization with probe 3 (SEQ ID NO:24) was performed as before, but the 
filters were washed three times for 10 min at 65 °C in 3 x SSPE and 0.1% SDS 
before exposure. Approximately 400 ^ZAPII clones yielded strong positive signals, 
and 34 of these were purified through a second round of hybridization at 65°C. 
Approximately 400 additional clones yielded weak positive signals with probe 3 
(SEQ ID NO:24), and 1 8 of these were purified through a second round of 
hybridization for 20 h at 45 °C. Purified XZAP II clones isolated using all five 
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probes were in vivo excised as Bluescript II SK(-) phagemids and transformed into 
E. coli XLOLR according to the manufacturer's instructions (Stratagene). The size of 
each cDNA insert was determined by PCR using T3 (SEQ ID NO;29) and T7 (SEQ 
ID NO:30) promoter primers and selected inserts (>1.5 kb) were partially sequenced 
5 from both ends. 

EXAMPLE 3 

(ii -anti fir Monotcrncne Synthase cDN A Expression in E. coli and Enzyme 

Assavs 

Except for cDNA clones AG3.18 (SEQ ID NO:3) and AG3.48 (SEQ ID 
10 NO:31), all of the partially sequenced inserts were either truncated at the 5'-end, or 
were out of frame, or bore premature stop codons upstream of the presumptive 
methionine start codon. For the purpose of functional expression, a 2023 bp insert 
fragment, extending from nucleotides 75 to 2097 of the sequence set forth in SEQ ID 
NO: 1, and a 191 1 bp insert fragment, extending from nucleotide 1 to nucleotide 1910 
15 of the sequence set forth in SEQ ID NO:3, were subcloned in frame into pGEX 
vectors (Pharmacia). A 2016 bp fragment extending from nucleotide 73 to 
nucleotide 2088 of the sequence set forth in SEQ ID NO:5 was subcloned in frame 
into the pSBETa vector (Schenk, P.M., Baumann, S., Mattes, R., and Steinbiss, H.-H. 
(1995) Biotechniques 19, 196-200). To introduce suitable restriction sites for 
20 subcloning. fragments were amplified by PCR using primer combinations 22-BamW\ 
(S'-CAA AG G GAT CC A GAA TGG CTC TGG-3')(SEQ ID NO:33) and2.2-/Vofl 
(5'-AGT AA G CGG CCG C TT TTT AAT CAT ACC CAC-3')(SEQ ID NO:34) with 
pAG2.2 insert (SEQ ID NO:l) as template, 3.1 8-£coRl (5'-CTG CAG GAA TTC 
GGC ACG AGC-3'XSEQ ID NO:35) and 3.18-SmaI (5'-CAT AGC CCC GGG CAT 
25 AGA TTT GAG CTG-3')(SEQ ID NO:36) with pAG3.18 insert (SEQ ID NO:3) as 
template, and 10-MfeI (5-GGC AGG AA C ATA TG G CTC TCC TTT CTA TCG- 
3')(SEQ ID NO:37) and \0-BamH\ (5'-TCT AGA ACT AGT GGATCC CCC GGG 
CTG CAG-3')(SEQ ID NO:38) with pAGlO insert (SEQ ID NO:5) as template. 

PCR reactions were performed in volumes of 50 ul containing 20 mM 
30 Tiis/llCl (pll8.8). 10 mM KCI, 10 mM (N11 4 ) 2 S0 4 ,2 mM MgS0 4 , 0.1% Triton 
X-100. 5 ug bovine serum albumin (BSA), 200 uM of each dNTP, 0.1 uM of each 
primer. 2.5 units of recombinant Pfu polymerase (Stratagene) and 100 ng plasmid 
DNA with the following program: denaturation at 94 °C, 1 min; annealing at 60 
°C, 1 min; extension at 72 °C, 3.5 min; 35 cycles with final extension at 72 °C, 5 
35 min. The PCR products were purified by agarose gel electrophoresis and used as 
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template for a secondary PCR amplification with the identical conditions in total 
volumes of 250 ul each. Products from this secondary amplification were digested 
with the above indicated restriction enzymes, purified by ultrafiltration and then 
ligated, respectively, into flawHI/Wo/I-digested pGEX-4T-2 to yield plasmid 
PGAG2.2, into EcoRl/Smal -digested pGEX-4T-3 to yield plasmid pGAG3.18, and 
into AVel/Z^/nHI-digested pSBETa to yield plasmid pSBAGlO; these plasmids were 
then transformed into E. coli XL 1 -Blue or E. coli BL21(DE3). 

For expression, bacterial strains E. coli XLOLR/pAG3.18, E. coli 
XLOLR/pAG3.48, E. coli XL 1 -Blue/pGAG2.2, E. coli XLl-Blue/pGAG3.18, and 
E. coli BL21(DE3)/pSBAG10 were grown to A 6UU = 0.5 at 37 °C in 5 ml of LB 
medium (Sambrock, J., Fritsch, E. F., and Maniatis, T. (1989) Molecular Cloning: A 
Laboratory Manual, 2d Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, 
NY) supplemented with 100 ug ampicillin/ml or 30 ug kanamycin/ml as determined 
by the vector. Cultures were then induced by addition of 1 mM 
isopropyl-l-ihio-[3-D-galactopyranoside and grown for another 12 h at 20 °C. Cells 
were harvested by centrifugation (2000 x g, 10 min) and resuspended in either 1 ml 
monoterpene synthase assay buffer [50 mM Tris/HCl (pH 7.5), 500 mM KC1, 1 mM 
MnCl 2 . 5 mM dithiothreitol, 0.05% (w/v) NaHS0 3 and 10% (v/v) glycerol], 1 ml 
sesquiterpene synthase assay buffer [10 mM dibasic potassium phosphate, 1.8 mM 
monobasic potassium phosphate (pH 7.3), 140 mM NaCl, 10 mM MgCl 2 , 5 mM 
dithiothreitol, 0.05% (w/v) NaHS0 3 and 10% (v/v) glycerol], or 1 ml diterpene 
synthase assay buffer [30 mM Hepes (A-2-hydroxyethylpiperazine-/v"-2- 
ethanesullonic acid, pi 17.2), 7.5 mM MgCl 2 , 5 mM dithiothreitol. 10 uM MnCl 2 , 
0.05% (w/v) NaHSO> 3 and 10% (v/v) glycerol]. 

Cells were disrupted by sonication (Braun-Sonic 2000 with microprobe at 
maximum power for 15 seconds at 0-4°C), the homogenates were cleared by 
centrifugation (18,000 x g, 10 min), and 1 ml of the resulting supernatant was 
assayed for monoterpene synthase activity with 2.5 uM of [1 - 3 H]geranyl 
diphosphate, for sesquiterpene synthase activity with 3.5 uM [ 1 - 3 H]famesyl 
diphosphate, or for diterpene synthase activity with 5 uM [1 - 3 H]geranylgeranyl 
diphosphate following standard protocols (Croteau, R., and Cane, D.E. (1985) 
Methods Enzymol. 110:383-405; LaFever. R.E., Stofer Vogel, B., and Croteau, R. 
(1994) Arch. Biochem. Biophys. 313:139-149; Dehal, S.S., and Croteau, R. (1988) 
Arch. Biochcm. Biophys. 261:346-356). In the case of the monoterpene synthase and 
sesquiterpene synthase assays, the incubation mixture was overlaid with 1 ml pentane 
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to trap volatile products. In all cases, after incubation at 3 1°C for 2 h, the reaction 
mixture was extracted with pentane (3 x 1 ml) and the combined extract was passed 
through a 1.5 ml column of anhydrous MgSC) 4 and silica gel (Mallmckrodt 60 A) to 
provide the terpene hydrocarbon fraction free of oxygenated metabolites. The 
columns were subsequently eluted with 3 x 1 ml of ether to collect any oxygenated 
products, and an aliquot of each fraction was taken for liquid scintillation counting to 
determine conversion rate. 

Product Identification - To obtain sufficient product for analysis by radio- 
C.1..C (gas liquid chromatography), chiral capillary GLC and GLC-MS (mass 
spectrunv'spectrometry), preparative-scale enzyme incubations were carried out. 
Thus, the enzyme was prepared from 50 ml of cultured bacterial cells by extraction 
with 3 ml of assay buffer as above, and the extracts were incubated with excess 
substrate overnight at31°C. The hydrocarbon fraction was isolated by elution 
through MgSO,,-silica gel as before, and the pentane eluate was concentrated for 
evaluation by capillary radio-GLC as described (Croteau, R., and Satterwhite, D.M. 
(1990) ./. Chromatogr. 500:349-354), by chiral column capillary GLC (Lewinsohn, 
li.. Savage. T.J., Gijzcn, M. and Croleau, R. (1993) Phytochem. Anal. 4:220-225), 
and by combined GLC-MS [Hewlett-Packard 6890 GC-MSD with cool (40°C) on- 
column injection, detection via electron impact ionization (70 eV), He carrier at 0.7 
psi., column: 0.25 mm i.d. x 30 m fused silica with 0.25 urn film of 5MS (Hewlett- 
Packard) programmed from 35°C (5 min hold) to 230°C at 5°C/min]. 

EXAMPLE 4 
Sequence Analysis 

Inserts of all recombinant bluescript plasmids and pGEX plasmids were 
completely sequenced on both strands via primer walking and nested deletions 
(Sambrock, j., Fritsch, E.F.. and Maniatis, T. (1989) Molecular Cloning. A 
Laboratory Manual, 2d Ed.. Cold Spring Harbor Laboratory, Cold Spring Harbor, 
NY) using the DyeDeoxy Terminator Cycle Sequencing method (Applied 
B.osystems). Sequence analysis was done using the Wisconsin Package Version 9.0, 
Genetics Computer Group (GCG), Madison, Wl. 

EXAMPLE fi 
UNA Extraction and Northern Blottinp 

Grand fir sapling stem tissue was harvested prior to wounding or two days 
after wounding by a standard procedure (Gijzen, M.. Lewinsohn, E., and Croteau, R. 
(1991M,- C /,. Biochem. Biophys. 289:267-273). Total RK'A was isolated (Lewinsohn, 
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E., Steele. C.L., and Croteau, R. (1994) Plant Mol. Biol. Rep. 12:20-25 ) and 20 fig of 
RNA per gel lane was separated under denaturing conditions (Sambrock, J., Fritsch, 
E.F.. and Maniatis, T. (1989) Molecular Cloning: A Laboratory Manual, 2d Ed.,' 
Cold Spring Harbor Laboratory, Cold Spring Harbor, NY) and transferred to 
nitrocellulose membranes (Schleicher and Schuell) according to the manufacturer's 
protocol. To prepare hybridization probes, cDNA fragments of 1.4-1.5 kb were 
amplified by PCR from AG2.2 (SEQ ID NO: 1 ) with primer JB29 (5'-CTA CCA TTC 
CAA TAT CTG-3'XSEQ ID NO:39) and primer 2-8 (5'-GTT GGA TCT TAG AAG 
'ITC CC-3'XSEQ ID NO:40). from AG3. IS (SEQ ID NO:3) with primer 3-9 (5'-TTT 
CCA ITC CAA CCT CTG GG-3'XSEQ ID NO:41) and primer 3-1 1 (5"-CGT AAT 
G( iA AAG CTC TGG CG-3')(SEQ ID NO:42), and from AGIO (SEQ ID NO-5) with 
Pmner7-1 (5-CCT TAC ACQ CCT TTG GAT GG-3')(SEQ ID N043) and 
P".»« 7-3 (5'-TCT GTT GAT CCA GGA TGG TC-3')(SEQ ID NO:44). The probes 
"ere randomly labeled with [«- 32 P]dATP (Feinberg, A.P., and Vogelstein, B. (1984) 
Anal lUochem. 137:266-267). Blots were hybridized for 24 h at 55°C in 3 x SSPE 
and 0.1% SDS. washed at 55°C in 1 x SSPE and 0.1%, SDS and subjected to 
autoradiography as described above at -80°C for 24 h. 

EXAMPLE 6 

Cloning and Characterizatio n of Clones AG1.28 (SF.Q ID NO:15), AG2.2 
(SEP ID NO:l>, AG4.30 fSEP ID N Q:17) and AG 5.9 (SEP ID NO-iQ) 
Similarity-Based Cloning of Grand fu- Terpene Synthases - Grand fir has 
been developed as a model system for the study of induced oleoresin production in 
conifers in response to wounding and insect attack (Johnson, M.A., and Croteau, R. 
(1987) in Ecology and Metabolism of Plant Lipids (Fuller, G., and Nes, W.D., eds) 
pp. 76-91, American Chemical Society Symposium Series 325, Washington, 
DC; Gijzen, M., Lewinsohn, E., Savage, T.J., and Croteau, R.B. (1993) in Bioactive 
Volatile Compounds from Plants (Teranishi, R., Buttery, R.G., and Sugisawa, H., 
eds) pp. 8-22, American Chemical Society Symposium Series 525, Washington^ 
DC: Raffa. K. F., and Berryman. A.A. (1982) Can. Entomol. 114:797-810; Steele, C., 
Lewinsohn, E.. and Croteau. R. (1995) Proc. Natl. Acad. Sci. USA 92:41 64- 
4168; Lewinsohn, E., Gijzen, M., and Croteau, R. (1992) in Regulation of 
Isopentenoid Metabolism (Nes, W.D., Parish, E.J., and Trzaskos, J.M., eds) pp. 8-17, 
American Chemical Society Symposium Series 497, Washington. DC). The 
chemistry and biosynthesis of the oleoresin monoterpenes, sesquiterpenes and 
d.terpenes have been well defined (Lewinsohn, E., Savage, T.J., Gijzen, M., and 
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Croteau. R. (1993) Phylochem. Anal 4:220-225; Lewinsohn, E., Gijzen, M.. and 
Croteau, R. (1991) Plant Physiol 96:44-49; Funk, C, Lewinsohn, E., Stofer Vogel, 
B.. Steele C, and Croteau, R. (1994) Plant Physiol 106:999-1005; Gijzen, M., 
Lewinsohn, I-., and Croteau, R. (1991) Arch. Biociwm. Biophys. 289:267-273; 

5 Lewinsohn, E.. Gijzen, NL, and Croteau, R. (1992) Arch. Biochem. 
Biophys. 293:167-173; LaFever, R.E., Stofer Vogel, B., and Croteau, R. (1994) Arch. 
Biochem. Biophys. 313: 139-1 49; Funk, C, and Croteau, R. (1994) Arch. Biochem. 
Biophys. 308:258-266); however, structural analysis of the responsible terpene 
synthases as well as studies on the regulation of oleoresinosis require the isolation of 

10 cDNA species encoding the terpene synthases. Protein purification from conifers, as 
the basis for cDNA isolation, has been of limited success (Stofer Vogel, B., Wildung, 
M.R., Vogel, G., and Croteau, R. (1996) J. Biol Chem. 271:23262-23268) and thus 
iar has not permitted cloning of any of the monoterpene synthases from these species 
(Steele, C, Lewinsohn, E., and Croteau, R. (1995) Proc. Natl Acad Sci. 

15 USA 92:4164-4168). 

As a possible alternative to protein-based cloning of terpene synthases, a 
homology-based PCR strategy was proposed (Steele, C, Lewinsohn, E., and 
Croteau, R. (1995) Proc. Natl. Acad Sci. USA 92:4164-4168) that was founded upon 
the three terpene synthases of plant origin then available, a monoterpene synthase, 

20 (-)-4S-limonene synthase, from spearmint (Mentha spicata, Lamiaceae) (Colby, 
S.M, Alonso, W.R., Katahira, E.J., McGarvey, D.J., and Croteau, R. (1993) J. Biol 
Chem. 268:23016-23024), a sesquiterpene synthase, 5-<?p/-aristolochene synthase, 
from tobacco {Nicotiana tabacum, Solanaceae) (Facchini, P. J., and Chappell, 
J. (1992) Proc. Natl Acad Sci. USA 89:1 1088-1 1092), and a diterpene synthase, 

25 casbene synthase, from castor bean {Ricinus communis, Euphorbiaceae) (Mau, 
C.J.D., and West, C.A. (1994) Proc. Natl Acad. Sci. USA 91:8497-8501). Despite 
the taxonomic distances between these three angiosperm species and the differences 
in substrate utilized, reaction mechanism and product type of the three enzymes, a 
comparison of the deduced amino acid sequences identified several conserved 

30 regions that appeared to be useful for the design of degenerate PCR primers (see 
Example 1 ). Using cDN A from a wound-induced Grand fir stem library as template, 
PCR primers C (SEQ ID NO:9) and D (SEQ ID NO: 10) amplified products 
corresponding to four distinct sequence groups, all of which showed significant 
similarity to sequences of cloned terpene synthases of plant origin. The four 

35 different inserts were designated as probes 1 (SEQ ID NO:l 1), 2 (SEQ ID NO: 12), 4 
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(SEQ ID NO: 13) and 5 (SEQ ID NO: 14), and were employed for .solution of the 
corresponding cDNA clones by plaque hybridization. 

Screening of 10 5 cDNA phage plaques from the wounded Grand fir stem 
library, with each of the four probes, yielded a four-fold difference in the number of 
pos.tives, most likely reflecting different levels of expression of the corresponding 
genes. Size selected inserts (>1.5 kb) of purified and in vivo excised clones were 
partially sequenced from both ends, and were shown to segregate into four distinct 
groups corresponding to the four hybridization probes. Since all cDNAs 
corresponding to probes 1 (SEQ ID NO:ll),4 (SEQ ID NO:13) and 5 (SEQ ID 
NO: 14) were truncated at their 5'-ends, only inserts of the largest representatives of 
each group, clone AG1.28 (SEQ ID NO:15), clone AG2.2 (SEQ ID NOT) 
(apparently full length), clone AG4.30 (SEQ ID NO:17) and clone AG5.9 (SEQ ID 
NO: 19). were completely sequenced. Clone AG1.28 (SEQ ID NO:15)(2424 bps) 
includes an open reading frame (ORF) of 2350 nucleotides (nts) encoding 782 amino 
acids (SEQ ID NO:16); clone AG2.2 (SEQ ID NO:l)(2196 bps), includes an ORF 
of 1881 nts encoding 627 amino acids (SEQ ID NO:2); clone AG4.30 (SEQ ID 
NO: 17)(I967 bps) includes an ORF of 1731 nts encoding 577 amino acids (SEQ ID 
NO:18) and clone AG5.9 (SEQ ID NO:19)(1416 bps) includes an ORF of 1194 
nucleotides encoding 398 amino acids (SEQ ID NO:20). 

cDNA clones AG1.28 (SEQ ID NO: 15), AG2.2 (SEQ ID NOl) AG4 30 
(SEQ ID NO:17) and AG5.9 (SEQ ID NO:19) were compared pairwise with each 
other and with other cloned plant terpene synthases. Truncated clone AG 1.28 (SEQ 
ID NO: 15) resembled most closely in size and sequence (72% similarity, 49% 
identity) a diterpene cyclase, abietadiene synthase, from Grand fir (Stofer Vogel B 
Wildung, M.R., Vogel, G., and Croteau, R. (1996) J. Biol. Chem. 271-23262-23268.)' 
Clones AG4.30 (SEQ ID NO:17) and AG5.9 (SEQ ID NO:19, share 
approximately 80% similarity (60% identity) at the amino acid level, and are almost 
equally distant from both clone AGJ.28 (SEQ ID NO:15) and full-length clone 
AG2.2 (SEQ ID NO:l)(range of 65-70% similarity and 45-47% identity); the amino 
ac.d sequence similarity between AGJ.28 (SEQ ID NO: 15) and AG2.2 (SEQ ID 
NO:l) is 65% (41% identity). Considering the high level of homology between 
AG4.30 (SEQ ID NO:17) and AGS. 9 (SEQ ID NO:19), these comparisons suggest 
that the four new cDNAs, AG 1.28 (SEQ ID NO: 15), AG2.2 (SEQ ID NO l) AG 4 30 
(SEQ ID NO:17) and AG5.9 (SEQ ID NO:19), represent the three major subfamilies 
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of Grand fir terpene synthase genes encoding monoterpene synthases, sesquiterpene 
synthases and diterpene synthases. 

Identification of cDNA Clone A G2.2 (SEQ ID NO:l) as Myrcene Synthase 
- The P AG2.2 insert (SEQ ID NO:l I appeared to be a full-length clone encoding a 
5 protein of molecular weight 72,478 with a calculated pi at 6.5. The size of the 
translated protein encoded by AG2.2 (SEQ ID NO:l) (627 residues)(SEQ ID NO:2) 
is in the range of the monoterpene synthase preproteins for limonene synthase from 
spearmint (Colby, S.M., Alonso, W.R., Katah.ra, E.J., McGarvey, D.J., and Croteau, 
R. ( 1993] J. Biol. Chem. 268:23016-23024) and Perillafrutescens (Yuba, A„ Yazaki, 
H. K... Tahata. M.. Honda, G., and Croteau, R. (1996) Arch. Biochem. Biophys. 332:280- 
2X7). hut is about 240 amino acids shorter than the two gymnosperm diterpene 
svntha.sc preproteins for abietadiene synthase (Stofer Vogel, B., Wildung, M.R., 
Yogd. Ci„ and Croteau, R. (1996) ./. Biol. Chem. 271:23262-23268) and taxadiene 
syntlu.se (Wildung. M.R., and Croteau, R. (1996) J.Biol. Chem. 271:9201-9204). 
1 s Monoterpene and diterpene biosynthesis are compartmentalized in plastids whereas 
sesquiterpene biosynthesis is cytosolic (reviewed in Kleinig, H. (1989) Annu. Rev. 
Ham Physiol. Plant Mol. Biol. 40:39-53; Gershenzon, J., and Croteau, R. (1993) in 
Upui Metabolism in Plants (Moore, T.S. Jr., ed) pp. 339-388, CRC Press, Boca 
Raton. FL; McGarvey, D.J., and Croteau, R. (1995) Plant Cell 7, 1015-1026); thus, 
20 monoterpene and diterpene synthases are encoded as preproteins bearing an amino- 
lerminal transit peptide for import of these nuclear gene products into plastids where 
they are proteolytically processed to the mature forms (Keegstra, K., Olsen, J.J., and 
Theg, S.M. (.1989) Annu. Rev. Plant Physiol. Plant Mol. Biol. 40:471-501 ; 
von Heijne, G., Stepphulin, J., and Herrmann (1989) Eur. J. Biochem. 180:535-545). 
25 Both the size of the deduced protein and the presence of an //-terminal domain (of 60 
to 70 amino acids) with features characteristic of a targeting sequence [rich in serine 
residues (16-18%) and low in acidic residues (four Asp or Glu) (Keegstra, K., Olsen, 
J. J., and Theg, S.M. (.1989) Annu. Rev. Plant Physiol. Plant Mol. Biol. 40:471-501; 
von ilcijne. G., Stepphuhn. J., and Herrmann (1989) Eur. J. Biochem. 180:535-545)] 
30 suggest that AG2.2 (SEQ ID NO:l) encodes a monoterpene synthase rather than a 
sesquiterpene synthase or a dilerpene synthase. 

Since pAG2.2 contained the terpene synthase insert in reversed orientation, 
the ORF was subcloned in frame with glutathione S-transferase, for ultimate ease of 
purification (Bohlmann, J., DeLuca, V., Eilert, U., and Martin, W. (1995) Plant J. 
35 7:491-501: Bohlmann, J.. Lms, T., Martin, W., and Eilert, U. (1996) Plant Physiol. 
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1 1 1 :507-5 1 4), into pGEX-4T-2, yielding plasmid pGAG2.2. The recombinant fusion 
protein was expressed in E. coli strain XLl-Blue/pGAG2.2, then extracted and 
assayed for monoterpene synthase, sesquiterpene synthase and diterpene synthase 
activity using tritium labeled geranyl diphosphate, farnesyl diphosphate and 
geranylgeranyl diphosphate as the respective substrate. Enzymatic production of a 
terpene olefin was observed only with geranyl diphosphate as substrate, and the only 
product was shown to be myrcene by radio-GLC and GLC-MS comparison to an 
authentic standard (FIGURE 3). Bacteria transformed with pGEX vector containing 
the AG2.2 insert (SEQ ID NO:l) in antisense orientation did not afford detectable 
myrcene synthase activity when induced, and the protein isolated and assayed as 
nbove. A myrcene synthase cDNA has not been obtained previously from any 
source, although myrcene is a minor co-product (2%) of the native and recombinant 
limonene synthase from spearmint (Rajaonarivony, J.I.M., Gershenzon, J., and 
Croteau, R. (1992) Arch. Biuchem. Biophys. 296:49-57; Colby, S.M., Alonso, W.R., 
Katahira, E.J., McGarvey, D.J., and Croteau, R. (1993) ./ Biol. Chem. 268:23016- 
23024 ) and of several enzymes from sage (Croteau, R., and Satterwhite. D.M. (1989) 
J.Biol. Chem. 264:15309-15315). cDNA cloning and functional expression of 
myrcene synthase, which is one of several wound-inducible monoterpene synthase 
activities of Grand fir (Gijzen. M, Lewinsohn, E., and Croteau, R. (1991) Arch 
Biochem. Biophys. 289:267-273), demonstrates that this acyclic monoterpene is 
formed by a distinct enzyme and is not a co-product of another synthase. 

EXAMPLE 7 

Cloning and Characterization nf Clones AG3.18 (SEP II) NQ:3) Encoding 
fjj-Pinene Synthase and cDNA Cln ne AGIO (SF.Q ID NO:51 F.nnn,lin f , 

(-)-Limonene Synthase 
Identification of cDNA Clone AG3.18 (SEQ ID NO:3) as (-J-Pinene 
Synthase and cDNA Clone AGIO (SEQ ID NO:5) as (-)-Limonene Synthase - 
Alignment of the four new terpene synthase cDNA sequences {AG 1.28 (SEQ ID 
NO:15). AG2.2 (SEQ ID NO: I). AG-f.30 (SEQ ID NO: 17) and AG.5.9 (SEQ ID 
NO: 19)), and that for abietadiene synthase (Stofer Vogel, B., Wildung, M.R., Vogel 
G., and Croteau, R. (1996) J.Biol. Chem. 271:23262-23268), allowed the 
identification of several conserved sequence motifs among this enzyme family from 
Grand fir, which provided the foundation for an extended similarity-based cloning 
approach. Two new sense primers E (SEQ ID NO:21) and F (SEQ ID NO:22) were 
designed according to conserved sequence elements, whereas a degenerate antisense 
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primer G (SEQ ID NO:23) was designed based upon very limited amino acid 
sequence information from pinene synthase (see Example 1). Only the combination 
of primers E (SEQ ID NO:21) and G (SEQ ID NO:23) amplified a specific product 
of 533 bps, which was designated as probe 3 (SEQ ID NO:24). 
5 Hybridization of 10 5 Grand fir XZA? II cDNA clones with probe 3 (SEQ ID 

NO:24) yielded two types of signals comprised of about 400 strongly positive clones 
and an equal number of weak positives, indicating that the probe recognized more 
than one type of cDNA. Thirty-four of the former clones and 18 of the latter were 
purified, the inserts were selected by size (2.0-2.5 kb), and the in vivo excised clones 
10 were partially sequenced from both ends. Those clones which afforded weak 
hybridization signals were shown to contain inserts that were either identical to 
myrcene synthase clone AG2.2 (SEQ ID NO:l) or exhibited no significant sequence 
similarity to terpene synthases. Clone AG3.48 (SEQ ID NO:31) contained the 
myrcene synthase ORE in the correct orientation and in frame for expression from 
1 5 the Bluescript plasmid vector. This cDNA was functionally expressed in E. coli and 
the resulting enzyme was shown to accept only geranyl diphosphate as the prenyl 
diphosphate substrate and to produce myrcene as the exclusive reaction product. 
This finding with AG3.48 (SEQ ID NO:31) confirms that expression of AG2.2 (SEQ 
ID NO:l) as the glutathione S-transferase fusion protein from pGAG2.2 does not 
20 influence substrate utilization or product outcome of the myrcene synthase. 

CI ones which gave strong hybridization signals segregated into distinct 
sequence groups represented by clone AG3.JS (SEQ ID NO:3)(2018 bp insert with 
ORF of 1884 nt; encoded protein of 628 residues at 71,505 Da and pi of 5.5) and 
AGIO (SEQ ID NO:5)(2089 bp insert with ORE of 1911 nt; encoded protein of 637 
25 residues at 73,477 Da and pi of 6.4). AG3.18 (SEQ ID NO:3) and AGIO (SEQ ID 
NO:5) form a subfamily together with the myrcene synthase clone AG2.2 (SEQ ID 
NO:l) that is characterized by a minimum of 79% pairwise similarity (64% identity) 
at the amino acid level. Like myrcene synthase, both AG3.1S (SEQ ID NO:3) and 
AGIO (SEQ ID NO:5) encode JV-terminal sequences of 60 to 70 amino acids which 
30 are rich in serine (19-22% and 11-15%, respectively) and low in acidic residues (4 
and 2, respectively) characteristic of plastid transit peptides (Keegstra, K., Olsen, J. J., 
and Theg, S.M. (1989) Annu. Rev. Plant Physiol Plant Mol Biol 40:471-501 ; 
von Heijne, G., Stepphuhn, J., and Herrmann (1989) Eur. J. Biochem. 180:535-545). 

Plasmid pAG3.18 (SEQ ID NO:3) contained the presumptive terpene 
35 synthase ORF in frame for direct expression from the bluescript plasmid, whereas the 
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AG JO (SEQ ID NO:5) ORF was in reversed orientation. Both AG3 18 (SEQ ID 
NO:3) and AGIO (SEQ ID NO:5) were subcloned into expression vectors yielding 
plasmids pGAG3.18 and pSBAGlO. Recombinant proteins were expressed in 
bacterial strain E. coli XLOLR/pAG3. 1 8, E coli XLl-Blue/pGAG3.18 and E coli 
5 BL21(DE3)/ P SBAG10. When extracts of the induced cells were tested for terpene 
synthase activity with all of the potential prenyl diphosphate substrates, only geranyl 
diphosphate was utilized. Extracts from E. coli BL21(DE3)/ P SBAG10 converted 
geranyl diphosphate to limonene as the major product with lesser amounts of 
a-pinene, p-pinene and p-phellandrene, as determined bv radio-GLC and combined 
1 0 GLC-MS (FIGURE 5). Chiral phase capillary GLC on P-cyclodextrin revealed the 
hmonene product to be the (-)-4S-enantiomer and the pinene products to be the 
related (-)-( LS':5.V)-enantiomers. Although optically pure standards were not 
available lor the analysis, stereochemical considerations suggest that the minor 
product p- P hellandrene is also the mechanistically related (-)-(45)-antipode 
1> (Gambliel, II., and Croteau, R. (1984) J.Biol. Chem. 259:740-748" Croteau R 
Satterwh.te. D.M., Cane. D.E., and Chang, C.C. (1988) J. Biol. Chem. 263:10063- 
10071; Wagschal, K., Savage, T.J.. and Croteau, R. (1991) Tetrahedron 47:5933- 
5944; Croteau, R., Satterwhite, D.M., Wheeler, C.J., and Felton. N.M (1989) J. Biol 
Chem. 264:2075-2080; LaFever, R.E., and Croteau, R. (1993) Arch Biochem 

20 B.ophys. 301:361-366). Similar analysis of the monoterpene products generated 
from geranyl diphosphate by cell-free extracts of E. coli XLOLR/pAG3. 1 8 and 
E coli XLl-Blue/pGAG3.18 demonstrated the presence of a 42:58% mixture of 
a-pinene and P -pinene (FIGURE 4), the same product ratio previously described for 
the purified, native (-)-pinene synthase from Grand fir (Lewinsohn, E Gijzen M 

25 and Croteau, R. (1992) Arch. Biochem. Biophys. 293:167-173). Chiral phase 
capillary GLC confirmed the products of the recombinant pinene synthase to be the 
(-)-(lS:5S)-enantiomers, as expected. No other monoterpene co-products were 
detected with the recombinant (-)-pinene synthase, as observed previously for the 
native enzyme (Lewinsohn, E., Gijzen, M., and Croteau, R. (1992) Arch. Biochem 

30 Biophys. 293:167-173). 

Evidence for the formation of both a- and p-pinene by a single enzyme has 
been previously provided through co-purification studies, and differential inhibition 
and mactivation studies, as well as by isotopically sensitive branching experiments 
(Gambliel, IL, and Croteau, R. (1984) J. Biol. Chem. 259:740-748; Wagschal, K.C. 
33 Pyun, H.-J.. Coates, R.M., and Croteau, R. (1994) Arch. Biochem. Biophys. 308:477- 
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487: Wagschal, K., Savage, T.J.. and Croteau, R. (1991) Tetrahedron 47:5933- 
5944; Croteau. R., Wheeler, C.J., Cane. D.E., Ebert, R., an d Ha, H.-J (1987) 
Biochemistry 26:5383-5389). The cDNA cloning of pinene synthase provides the 
ultimate proof that a single enzyme forms both products. The calculated molecular 
weight of the (-)-pinene synthase deduced from AG3.18 (SEQ ID NO:3) is 
approximately 64,000 (excluding the putative transit peptide), which agrees well with 
the molecular weight of 63,000 established for the native enzyme from Grand fir by 
gel permeation chromatography and SDS-PAGE (Lewinsohn, E., Gijzen, M., and 
Croteau. R. ( 1 992) Arch. Biochem. Biophys. 293: 1 67-1 73). 

A limonene synthase cDNA has thus far been cloned only from two very 
closely related angiosperm species (Colby, S.M., Alonso, W.R., Katahira, E.J., 
McGarvey, D.J., and Croteau, R. (1993) J. Biol. Chem. 268:23016-23024; Yuba, a" 
Yazaki. K„ Tabata, M., Honda, G„ and Croteau, R. (1996) Arch Biochem 
Biophys. 332:280-287), and the isolation of a pinene synthase cDNA has not been 
reported before. Pinene synthase has previously received considerable attention as a 
major defense-related monoterpene synthase in conifers (Gijzen, M., Lewinsohn, E 
and Croteau, R. (1991) Arch. Biochem. Biophys. 289:267-273; Lewinsohn,' e', 
Gijzen, M., and Croteau, R. (1992) Arch. Biochem. Biophys. 293:167-173). In the 
Grand fir cDNA library, which was synthesized from mRNA obtained from wound- 
induced sapling stems, clones corresponding to pinene synthase are at least ten times 
more abundant than clones for myrcene synthase. This finding reflects the relative 
proportions of the induced levels of activities of these enzymes in Grand fir saplings- 
pinene synthase and limonene synthase are the major monoterpene synthase activities 
whereas the induced level of myrcene synthase activity is relatively low (Gijzen, M., 
Lewinsohn, E.. and Croteau, R. (1991) Arch. Biochem. Biophys. 289:267-273). The 
cDNAs for inducible monoterpene synthases provide probes for genetic and 
molecular analysis of olecresin-based defense in conifers. Northern blots 
(FIGURE 6) of total RNA extracted from non-wounded sapling stems and from 
stems two days after wounding (when enzyme activity first appears) were probed 
with cDNA fragments for AG2.2 (SEQ ID NO:l), AG3.18 (SEQ ID NO:3) and AGIO 
(SEQ ID NO:5), and thus demonstrated that increased mRNA accumulation for 
monoterpene synthases is responsible for this induced, defensive response in Grand 
Hr. The availability of cloned, defense-related monoterpene synthases presents 
several possible avenues for transgenic manipulation of oleoresin composition to 
.mprove tree resistance to bark beetles and other pests. For example, altering the 
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monoterpene content of oleoresin may chemically disguise the host and decrease 
insect aggregation by changing the levels of pheromone precursors or predator 
attractants. or lower infestation by increasing toxicity toward beetles and their 
pathogenic fungal associates (Johnson, M.A., and Croteau, R. (1987) in Ecology and 
Metabolism of Plant Lipids (Fuller, G., and Nes, W.D., eds) pp. 76-91, American 
Chemical Society Symposium Series 325, Washington, DC; Gijzen, M., Lewinsohn, 
E., Savage, T.J., and Croteau, R.B. (1993) in Bioactive Volatile Compounds from 
Plants (Teranishi. R., Buttery, R.G.. and Sugisawa, H., eds) pp. 8-22, American 
Chemical Society Symposium Series 525, Washington, DC; Byers, J.A. (1995) in 
Chemical Ecology of Insects 2 (Carde, R.T., and Bell, W.J.! eds) pp. 154-213, 
Chapman and Mall, New York). 

EXAMPLE 8 

Properties of the Recombin ant Monoterpene Synth ases Encoded hy 
cJ2NACl„nes A G2. 2 (SEP ID NO: J). A G3. IS (SEP ID NO- 7) and AGIO (SEP in 

NP:5) 

All three recombinant enzymes require Mn 2+ for activity, and Mg 2+ is 
essentially ineffective as the divalent metal ion cofactor. This finding confirms 
earlier results obtained with the native monoterpene synthases of Grand fir and 
lodgepole pine (Pinus contorta) (Lewinsohn, E., Gijzen, M., and Croteau. R. (1992) 
Arch. Biochem. Biophys. 293:167-173; Savage, T.J., Hatch, M.W., and Croteau, R. 
(1994) J. Biol. Chem. 269:4012-4020). All terpene synthases and prenyltransferases 
are thought to employ a divalent metal ion, usually Mg 2+ or Mn 2+ , in the ionization 
steps of the reaction sequence to neutralize the negative charge of the diphosphate 
leaving group (Croteau, R. (1987) Chem. Rev. 87:929-954; Cane, D.E. (1992) Ciha 
Found. Symp. Sen 171:163-167; Poulter, CD., and Rilling, H.C. (1981) in 
Biosynthesis of Isoprenoid Compounds (Porter, J.W., and Spurgeon, S.L., eds) 
Vol. 1, PP . 161-224, Wiley & Sons, New York), and all relevant sequences thus far 
obtained bear a conserved aspartate rich element (DDXXD)(SEQ ID NO:45) 
considered to be involved in divalent metal ion binding (Stofer Vogel, B., Wildung, 
M.R., Vogel. G., and Croteau, R. (1996) J. Biol. Chem. 271:23262-23268; Ashby] 
M.N.. and Edwards. P.A. (1990) ./. Biol. Chem. 265:13157-13164; Chen, A.' Kroon', 
P.A., and Poulter, D.C. (1994) Protein Sci. 3:600-607; Tarshis. L.C.. Yan. M.! 
Poulter, CD., and Sacchettini, J.C. (1994) Biochemistry 33: 1 087 1 -10877; Cane, 
D.E., Sohng, J.K., Lamberson, C.R., Rudnicki, S.M., Wu, Z.. Lloyd, M.D., Oliver! 
J.S., and Hubbard. B.R. (1994) ZWKvm.y/rv 33:5846-5857; Proctor, R.H., and 
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Holm. T.M. (1993) J. Biol. Chen, 268:4543-4548). In addition ,o this strict, general 
dependence on a divalent metal ion. the monoterpene synthases of conifers are 
unique in their further requirement for a monovalent cation (K + ), a feature that 
distinguishes the gymnosperm monoterpene synthases from their counterparts from 
angiosperm species and implies a fundamental structural and/or mechanistic 
difference between these two families of catalysts (Savage. T.J.. Hatch M W and 
Croteau, R. (1994) J. Biol. C/rc*. 269:4012-4020). All three recombinant 
monoterpene synthases depend upon K + , with maximum activity achieved at 
approximately 500 mM KCI. A requirement for K + has been reported lor a number 
of d.iferent types of enzymes, including those that catalyze phosphoryl cleavage or 
transfer reactions (Suelter, C.H. (1970) Science 168:789-794) such as Hsc70 ATPase 
(W.lbanks, S.M.. and McKay, D.B. (1995) J. Biol. Chem. 270:2251-2257) The 
crystal structure of bovine Hsc70 ATPase indicates that both Mg 2+ and K + interact 
directly with phosphate groups of the substrate and implicates three active she 
aspartate residues in Mg + and K + binding (Wilbanks, S.M., and McKay D B (1995) 
J. Biol. Chem. 270:2251-2257), reminiscent of the proposed role of the conserved 
DDXXD (SEQ ID NO:45) motif of the terpene synthases and prenyltransferases in 
divalent cat.on binding, a function also supported by recent site directed mutagenesis 
(Marrero, P.P., Poulter, CD., and Edwards, P.A. (1992) ./. Biol. Chen, 267:21873- 
21878; Joly. A., and Edwards, P.A. (1993) J. Biol. Chem. 268: 26983-26989- Song 
U and Poulter, CD. (1994) Proc. Natl. Acad. Sci. U.S.A. 91:3044-3048- Kovama' 
T.. Taj.ma. M„ Sano, H., Doi, T., Koike-Takeshha, A., Obata, S., Nishino T and 
Ogura, K. (1996) Biochemistry 35:9533-9538) and by X-ray structural analysis 
(Tarshis, L.C., Yan, M.. Poulter, CD., and Sacchettini, JC (1994) 
Biochemistry 33:10871-10877) of farnesyl diphosphate synthase. 

cDNA cloning and functional expression of the myrcene, limonene and 
pmene synthases from Grand fir represent the first example of the isolation of 
muluple synthase genes from the same species, and provide tools for evaluation of 
structure-function relationships in the construction of acyclic, monocyclic and 
b.cychc monoterpene products and for detailed comparison to catalysts from 
Phvlogenehcally distant plants that carry out ostensibly identical reactions (Gambliel 
H.. and Croteau, R. (1984) J.Biol. Chem. 259:740-748; Rajaonarivony JIM 
Gershenzon, J., and Croteau, R. (1992) Arch. Biochem. Btophys. 296 49-57- Colby' 
S.M., Alonso, W.R.. Katahira, E.J., McGarvey, D.J., and Croteau, R. (1993) J Biol. 
Chem. 268:23016-23024; Adam, K,P., Crock, J., and Croteau, R. (1996) Arch 
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Diochem. Biophys. 332:352-356). The recent acquisition of cDNA isolates encoding 
sesquiterpene synthases and diterpene synthases (Stofer Vogel, B., Wildung, M.R., 
Vogel, G., and Croteau, R. (1996) J. Biol. Chem. 271:23262-23268) from Grand fir 
should, together with the monoterpene synthases, also permit addressing the 
structural basis of chain-length specificity for prenyl diphosphate substrates in this 
family of related enzymes. 

EXAMPLE 9 

Se q u ence Comparison of Cloned Monoterpene Synthases 

Previous studies based on substrate protection from inactivation with 
selective amino acid modifying reagents have implicated functionally important 
cysteine, histidine and arginine residues in a range of different monoterpene 
synthases (Rajaonarivony. J.I.M., Gershenzon, J., and Croteau, R. (1992) Arch. 
Biochem. Biophys. 296:49-57; Lewinsohn, E., Gijzen, M., and Croteau, R. (1992) 
Arch. Biochem. Biophys. 293:167-173; Savage, T.J., Hatch, M.W., and Croteau. R. 
(1994) J.Biol. Chem. 269:4012-4020; Rajaonarivony, J.I.M., Gershenzon, J.. 
Miyazaki, J., and Croteau, R. (1992) Arch. Biochem. Biophys. 299:77-82; Savage! 
T.J.. Ichii, H., Hume, S.D., Little, D.B., and Croteau, R. (1994) Arch. Biochem. 
Biophys. 320:257-265 ). Sequence alignment of 21 terpene synthases of plant origin 
(Colby, S.M., Alonso, W.R., Katahira, E.J., McGarvey, D.J., and Croteau, R. (1993) 
J Biol. Chem. 268:23016-23024; Stofer Vogel, B., Wildung, MR., Vogel, G , and 
Croteau, R. (1996) J Biol. Chem. 271:23262-23268; Facchini, P.J.. and Chappell, 
J. (1992) Proc. Natl. Acad. Sci. USA 89:1 1088-1 1092; Mau, C.J.D., and West. C.A. 
(1994) Proc. Nad. Acad. Sci. USA 91 :8497-8501 ; Yuba, A., Yazaki, K.. Tabata, M., 
Honda, G., and Croteau, R. ( 1 996) Arch. Biochem. Biophys. 332:280-287; Wildung, 
M.R., and Croteau, R. (1996) J. Biol. Chem. 271:9201-9204; Yamaguchi,' S., Saito! 
T., Abe, H., Yamane, H„ Murofushi, N., and Kamiya, Y. (1996) Plant J 10:203-213;' 
Dudareva, N., Cseke, L., Blanc, V.M., and Pichersky, E. (1996) Plant Cell 8:1 137- 
1148; Chen. X.-Y., Chen, Y., Heinstein, P., and Davisson, V.J. (1995) Arch 
Biochem. Biophys. 324:255-266; Chen, X.-Y., Wang, M., Chen, Y., Davisson, J., and 
Heinstein. P. (1996) J.Nat. Prod. 59:944-95 1 ; Back, K., and Chappell, J. (1995) 
J. Biol. Chem. 270:7375-738 1 ) reveals two absolutely conserved arginine residues, 
corresponding to Arg 184 and Arg 365 of pinene synthase (SEQ ID NO:4), one highly 
conserved cysteine residue (pinene synthase Cys 543 )(SEQ ID NO:4), and one highly 
conserved histidine residue (pinene synthase His' 80 )(SEQ ID NO:4). The DDXXD 
35 (SEQ ID NO:45) sequence motif (pinene synthase Asp 379 , Asp 380 and Asp 383 ) (SEQ 
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ID NO:4) is absolutely conserved in all relevant plant terpene synthases, as are 

198 248 322 329 

several other amino acid residues corresponding to Phe , Leu , Glu Trp ** , 
Trp 46 ° and Pro 467 of pinene synthase (SEQ ID NO:4). 

Amino acid sequences of the plant terpene synthases were compared with 
5 each other and with the deduced sequences of several sesquiterpene synthases cloned 
from microorganisms (Proctor, R.H., and Hohn, T.M. (1993) J. Biol. 
Chem. 268:4543-4548; Back, K., and ChappelL J. (1995) J. Biol. Chem. 270:7375- 
7381; Hohn, T.M, and Desjardins, A.E. (1992) Mol Plant-Microbe 
Interactions 5:249-256). As with all other plant terpene synthases, no significant 
10 conservation in primary sequence exist? between the monoterpene synthases from 
Grand fir and the terpene synthases of microbial origin, except for the DDXXD (SEQ 
ID NO:45) sequence motif previously identified as a common element of all terpene 
synthases, and prenyltransferases which employ a related electrophilic reaction 
mechanism (Croteau, R., Wheeler, C.J., Cane, D.E., Ebert, R., and Ha, 11. -J. (1987) 
15 Biochemistry 26:5383-5389; Chen, A., Kroon, P. A., and Poulter, D.C. (1994) Protein 
Sci. 3:600-607; McCaskill, D., and Croteau, R. (1997) Adv. Biochem. Engineering 
Biotech. 55:108-146). The evidence is presently insufficient to determine whether 
extant plant and microbial terpene synthases represent divergent evolution from a 
common ancestor, which may also have given rise to the prenyltransferases, or 
20 whether these similar catalysts evolved convergently. 

EXAMPLE 10 

Alteration of Monoterpene Levels and Composition in Plant Seeds 

In accordance with the present invention, methods for increasing production 
of monoterpene compounds in a plant, particularly in plant seeds, are provided. The 
25 methods involve transforming a plant cell with a nucleic acid sequence encoding at 
least one gymnosperm monoterpene synthase, such as those encoded by the nucleic 
acid sequences set forth in SEQ ID NOT, SEQ ID NO:3 and SEQ ID NO:5. This 
has the effect of altering monoterpene biosynthesis, thereby increasing the production 
of monotcrpenes, as well as providing novel seed oils having desirable monoterpene 
30 compositions. In this manner, the transformed seed provides a factory for the 
production of modified oils. The modified oil itself may be used and/or the 
compounds in the oils can be isolated. Thus, the present invention allows for the 
production of particular monoterpenes of interest as well as speciality oils. 

The nucleic acid encoding the monoterpene synthases of the present invention 
35 can be used in expression cassettes for expression in the transformed plant tissues. 
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To alter the monoterpene levels in a plant of interest, the plant is transformed with at 
least one expression cassette comprising a transcriptional initiation region linked to a 
nucleic acid sequence encoding a monoterpene synthase. Such an expression cassette 
is provided with a plurality of restriction sites for insertion of the nucleic acid 
5 sequence encoding a monoterpene synthase so that it is under the transcriptional 
regulation of the regulatory regions. 

The transcriptional initiation sequence may be native or analogous to the host 
or foreign or heterologous to the host. In this regard, the term "foreign" means that 
the transcriptional initiation sequence is not found in the wild-type host into which 
10 the transcriptional initiation region is introduced. 

Of particular interest are those transcriptional initiation regions associated 
with storage proteins, such as napin, cruciferin, P-conglycinin, phaseolin, globulin or 
the like, and proteins involved in fatty acid biosynthesis, such as acyl carrier protein 
(ACT) See. U.S. Patent No. 5,420,034. herein incorporated by reference. 

The transcriptional cassette will preferably include, in the 5' to 3' direction of 
transcription, a transcriptional and translational initiation region, a gymnosperm 
monoterpene synthase DNA sequence of interest, and a transcriptional and 
translational termination region functional in plants. The termination region may be 
from the same organism as the transcriptional initiation region, may be from the 
same organism as the monoterpene synthase DNA, or may be derived from another 
source. Convenient termination regions arc available from the Ti-plasmid of 
A lumcfaciens, such as the octopine synthase and nopaline synthase termination 
regions. Other termination sequences are set forth in Guerineau etal., (1991), Mol. 
Gen. Gene,., 262:141-144; Proudfoot, (1991), Cell, 64:671-674; Sanfacon et a!., 
(1991). Genes Dev., 5:141-149; Mogen etal., (1990), Plant Cell, 2:1261-1272" 
Munroe etal., (1990), Gene, 91:151-15 8; Ballas et al., (1989), Nucleic Acids Res', 
17:7891-7903; Joshi et a]., ( 1 987), Nucleic Acid Res., 15:9627-9639). 

In the presently preferred form of the invention, a nucleic acid sequence 
encoding a gymnosperm monoterpene synthase protein will be targeted to plastids, 
such as chloroplasts, for expression. Thus, the nucleic acid sequence, or sequences] 
encoding a gymnosperm monoterpene synthase protein, or proteins, may be inserted 
into the plastid for expression with appropriate plastid constructs and regulatory 
elements. Alternatively, nuclear transformation may be used in which case the 
expression cassette will contain a nucleic acid sequence encoding a transit peptide to 
direct the monoterpene biosynthesis enzyme of interest to the plastid. Such transit 
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peptides arc known in the art. See. for example. Von Heijne et al. (1991) Plant A/c/. 
Biol. Rep. 9:104-126; Clark et al. (1989) J. Biol. Chem. 264:17544-17550; delia- 
Cioppa etal. (1987.) Plant Physiol. 84:965-968; Romer et al. (1993) Biochem. 
Bwphys. Res. Commun. 196:1414-1421; and, Shah etal. (1986) Science 233:478- 
481. Nucleic acid sequences encoding gymnosperm monoterpene synthases of the 
present invention may utilize native or heterologous transit peptides. 

The construct may also include any other necessary regulators such as plant 
translational consensus sequences (Joshi, CP., (1987), Nucleic Acids Research, 
15:6643-6653). introns (Luehrsen and Walbot, (1991), Mol. Gen. Genet., 225:81-93) 
and the like, operably linked to a nucleotide sequence encoding a monoterpene 
synthase of the present invention. 

It may be beneficial to include 5' leader sequences in the expression cassette 
which can act to enhance translation. Translation leaders are known in the art and 
include: picomavirus leaders, for example, EMCV leader (Encephalomyocarditis 5' 
noncoding region) (Elroy-Stein, O., Fuerst, T.R., and Moss, B. (1989) PNAS USA 
86:6126-6130); poly virus leaders, for example, TEV leader (Tobacco Etch Virus) 
(Allison et al. (1986); MDMV leader (Maize Dwarf Mosaic Virus); Virology, 154:9- 
20). and human immunoglobulin heavy-chain binding protein (BiP), (Macejak D G 
and Sarnow, P. (1991), Nature, 353:90-94; untranslated leader from the coat protein 
mRNA of alfalfa mosaic virus (AMV RNA 4), (Jobling, S.A., and Gehrke, L. (1987), 
Nature, 325:622-625; tobacco mosaic virus leader (TMV), (Gallie, D.R. et al. (1989)' 
Molecular Biology of RNA, pages 237-256; and maize chlorotic mottle virus leader 
(MCMV) (Lommel, S.A. et al. (1991), Virology, 81:382-385. See also. Della-Cioppa 
et al., (1987). Plant Physiology, 84:965-968. 

Depending upon where the monoterpene synthase sequence of interest is to be 
expressed, it may be desirable to synthesize the sequence with plant preferred 
codons, or alternatively with chloroplast preferred codons. The plant preferred 
codons may be determined from the codons of highest frequency in the proteins 
expressed in the largest amount in the particular plant species of interest See EPA 
0359472: EPA 0385962: WO 91/16432; Perlak et al. (1991) Proc. Natl. Acad. Sci. 
USA 88:3324-3328; and Murray et al. (1989) Nucleic Acids Research 17:477-498. In 
tins manner, tiie nucleotide sequences can be optimized for expression in any plant 
It is recognized that all or any part of the nucleic acid sequence encoding a 
gymnosperm monoterpene synthase protein may be optimized or synthetic. That is, 
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synthetic or partially optimized sequences may also be used. For the construction of 
chloroplast preferred genes, see U.S. Patent No. 5,545,817. 

In preparing the transcription cassette, the various DNA fragments may be 
manipulated, so as to provide for the DNA sequences in the proper orientation and in 
the proper reading frame. Towards this end, adapters or linkers may be employed to 
join the DNA fragments or other manipulations may be involved to provide for 
convenient restriction sites, removal of superfluous DNA, removal of restriction 
sites, or the like. For this purpose, in vitro mutagenesis, primer repair, restriction, 
annealing, resection, ligalion, or the like may be employed, where insertions, 
deletions or substitutions, such as transitions and transversions, may be involved. 

The recombinant DNA molecules of the invention can be introduced into the 
plant cell in a number of art-recognized ways. Those skilled in the art will appreciate 
that the choice of method might depend on the type of plant, i.e., monocot or dicot, 
targeted for transformation. Suitable methods of transforming plant cells include 
microinjection (Crossway et al. (1986) BioTechniques 4:320-334), electroporation 
(Riggs etal. (1986) Proc. Nail. Acad. Sci. USA 83:5602-5606), Agrobacterium 
mediated transformation (Hinchee etal. (1988) Biotechnology 6:915-921) and 
ballistic particle acceleration (see, for example, Sanford etal., U.S. Patent 
No. 4,945,050; and McCabe etal. (1988) Biotechnology 6:923-926). Also see, 
Weissinger etal. (1988) Annual Rev. Genet. 22:421-477; Sanford et al. (1987) 
Particulate Science and Technology 5:27-37 (onion); Christou etal. (1988) Plant 
Physiol. 87:671-674 (soybean); McCabe etal. (1988) Bio/Technology 6:923-926 
(soybean); Datta etal. (1990) Biotechnology 8:736-740 (rice); Klein et al (1988) 
Proc. Nail. Acad Sci. USA, 85:4305-4309 (maize); Klein etal. (1988) Biotechnology 
6:559-563 (maize); Klein etal. (1988) Plant Physiol. 91:440-444 (maize); Fronun 
etal. (1990) Biotechnology 8:833-839; and Gordon-Kamm etal. (1990) Plant Cell 
2:603-618 (maize). 

Alternatively, a plant plastid can be transformed directly. Stable 
transformation of chloroplasls has been reported in higher plants, see, for example, 
SVAB etal. (1990) Proc. Natl Acad. Sci. USA 87:85268530; SVAB & Maliga 
(1993) Proc. Natl. Acad Sci. USA 90:913-917; Staub & Maliga (1993) Embo 
.1. 12:601-606. The method relics on particle gun delivery of DNA containing a 
selectable marker and targeting of the DNA to the plastid genome through 
homologous recombination. In such methods, plastid gene expression can be 
accomplished by use of a plastid gene promoter or by trans-activation of a silent 
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plastid-borne transgene positioned for expression from a selective promoter sequence 
such as that recognized by T7 RNA polymerase. The silent plastid gene is activated 
by expression of the specific RNA polymerase from a nuclear expression construct 
and targeting of the polymerase to the plastid by use of a transit peptide. Tissue- 
5 specific expression may be obtained in such a method by use of a nuclear-encoded 
and plastid-directed specific RNA polymerase expressed from a suitable plant tissue 
specific promoter. Such a system has been reported in McBride et al. (1994) Proc. 
Nail. AcadSci. USA 91:7301-7305. 

The cells which have been transformed may be grown into plants by a variety 
10 of art-recognized means. See, for example, McConnick et al., Plant Cell Reports 
( 1MX()), 5:81-84. These plants may then be grown, and either selfed or crossed with a 
dit'lcrcnt plant strain, and the resulting homozygotes or hybrids having the desired 
plicnniypic characteristic identified. Two or more generations may be grown to 
ensure that the subject phenotypic characteristic is stably maintained and inherited 
1 5 and then seeds harvested to ensure the desired phenotype or other property has been 
achieved. 

As a host cell, any plant variety may be employed. Of particular interest, are 
plani species which provide seeds of commercial value. For the most part, plants 
will be chosen where the seed is produced in high amounts, a seed-specific product 

20 of interest is involved, or the seed or a seed part is edible. Seeds of interest in the 
practice of the present invention include, but are not limited to, the oil seeds, such as 
oilseed Brassica seeds, cotton seeds, soybean, safflower, sunflower, coconut, palm, 
and the like; grain seeds such as wheat, barley, oats, amaranth, flax, rye, triticale, rice 
and corn; other edible seeds or seeds with edible parts including pumpkin, squash, 

25 sesame, poppy, grape, mung beans, peanut, peas, beans, radish, alfalfa, cocoa, and 
coffee; and tree nuts such as walnuts, almonds, pecans, and chick-peas. 

EXAMPLE 11 

A Strategy For Cloning Cymnosnerm Monoterpenc Synthases 

The present invention includes gymnosperm monoterpene synthase proteins, 
30 and nucleic acid molecules that encode gymnosperm monoterpene synthase proteins. 
The amino acid sequence of each of the gymnosperm monoterpene synthase proteins 
of the present invention each includes at least one of the amino acid sequence 
elements disclosed in Table 1 . 



Table 1 



Amino Acids 


Sequence 


Orientation 


Comments 



4S 
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1. 70-77 


H S N (L. L V) WDDD 
<SEQIDNO:46) 


F only 


HS makes a poor reverse 
primer 


2. 148-153 


A L D Y V Y 

J. X X / 1 r J. V J. 

(SEQ ID NO: 47) 


r iiiiu j\ 




3. 306-312 


EI A K I F F 

(SEQ ID NO: 48) 


J dnu 1\ 




4. 328-333 


R W W K F9 

(SEQ ID NO: 49) 




r primer uses 1 nt of Ser 
codon; R uses 1 51 two nts 
or Arg codon (rare onlv) 


5. 377-383 


(V, I, DLDDMYD 

(SEQ ID NO: 50) 


F and R 




6. 377-383 


(V, I.LlLDDLYD 

(SEQ ID NO: 51) 


F and R 


Degeneracy of M/L at 381 
in*-;!! lKji biiiizic pniner 


7. 377-383 


( V, I, L) L DDI YD 

(SEQ ID NO: 52) 


F and R 




8. 543-549 


CYMKD (N, H) P 

(SEQ ID NO: 53) 


R 


F primer can also be 
constructed with this 
peptide but is too close to 
the 3 f end to be useful 



The numbers set forth in Table 1 for the first and last amino acid residue of 
each of the peptide sequences is the number of the corresponding amino acid residue 
in the amino acid sequence of the (-)-pinene synthase (SEQ ID NO:4) isolated from 
Abies grandis. Where a sequence of amino acid residues appears in brackets, e.g., 
(L,I,V) in Table 1. the first amino acid residue within the brackets is the residue that 
appears in the (-)-pinene synthase amino acid sequence set forth in SEQ ID NO:4. 
The subsequent amino acid residues within the brackets represent other ammo acid 
residues that commonly occur at the corresponding position in the amino acid 
sequence of other Abies grandis enzymes involved in terpene synthesis. 

In Table 1, the letter "F" refers to the forward PGR reaction, i.e., the PCR 
reaction which synthesizes the sense nucleic acid strand that encodes a gymnosperm 
monoterpene synthase. The letter "R" refers to the reverse PCR reaction, i.e., the 
PCR reaction that synthesizes the antisense nucleic acid molecule that is 
complementary to the sense nucleic acid strand synthesized in the forward PCR 
reaction. 
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In order to clone nucleic acid molecules encoding gymnosperm monoterpene 
synthases of the present invention, one or more oligonucleotide molecules 
corresponding to at least a portion of one of the amino acid sequences set forth in 
Table 1 can be used as a probe or probes with which to screen a genomic or cDNA 
library derived from one or more gymnosperm species. In this context, the term 
"corresponding," or "correspond" or "corresponds," means that the oligonucleotide 
base sequence either a) encodes all or part of at least one of the amino acid sequences 
set iorth in Table 1, or b) is complementary to a base sequence that encodes all or 
part ol at least one of the amino acid sequences set forth in Table 1. The 
oligonucleotide probe(s) may contain a synthetic base, such as inosine, which can be 
substituted for one or more of the four, naturally-occurring bases, i.e.. adenine ("A"), 
guanine ("G"), cytosine ("C") and thymine ("T"). Thus, for example, the following 
oligonucleotide sequences "correspond" to the tripeptide sequence M M M: 
5'ATGATGATG3' (sense orientation) (SEQ ID NO:54); 3 TACT ACT AC 5* 
1 5 ( antisense orientation) (SEQ ID NO:55) and 3'IACI ACIAC5' (SEQ ID NO:56). 

One or more oligonucleotide sequence(s), corresponding to at least a portion 
of at least one of the amino acid sequences set forth in Table 1, can be used to screen 
a nucleic acid library in order to identify monoterpene synthase clones of the present 
invention, according to methods well known to one of ordinary skill in the art. See, 
20 e.g., Samhrook et al, supra. The stringency of the hybridization and wash conditions 
during library screening in accordance with the present invention, utilizing one or 
more oligonucleotide sequence(s) corresponding to at least a portion of at least one of 
the amino acid sequences set forth in Table 1, is at least: for the hybridization step, 
6X SSPE, 40-45°C, for 36 hours; for the wash step, 3X SSPE, 45°C, 3X15 minute 
25 washes. The presently preferred hybridization and wash conditions during library 
screening, utilizing one or more oligonucleotide sequence(s) corresponding to at least 
a portion of at least one of the amino acid sequences set forth in Table 1, in 
accordance with the present invention are: for the hybridization step, 6X SSPE, 40- 
45°C, for 36 hours; for the wash step, 0.1X SSPE, 65°C-70°C, 3X15 minute washes. 
30 Examples of oligonucleotide sequences, corresponding to at least one of the 

amino acid sequences set forth in Table 1, that hybridize, under the foregoing 
hybridization and wash conditions, to the sense strands of the nucleic acid sequences 
of the present invention that encode gymnosperm monoterpene synthase proteins are 
set forth in Table 2. 
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Table 2 



Nucleic Acid Sequence 


Corresponds to: 


GTG TCG TTG GAG ACC CTG CTG CTG 
(SEQ ID NO:57) 


SEC) ID No 46 


CGG GAG CTG ATG CAG ATG (SEQ ID 
NO:58) 


SEC) ID No 47 


CTC GAG CGG TTC GAG CTC AAG 
(SEQ IDNO:59) 


SEQ ID No. 48 


GCC ACC ACC TTC CTC TCG (SEQ ID 
NO:60) 


SEQ ID No. 49 


GAG GAG CTG CTG TAC ATG CTG 
(SEQ ID NO:61 ) 


SEQ ID No. 50 


GAG GAG CTG CTG GAG ATG CTG 
(SEQ IDNO:62) 


SEQ ID No. 51 



Similarly, each of the gymnosperm monoterpene synthase clones set forth in 
SEQ ID NOT, SEQ ID NO:3 and SEQ ID NO:5, or a portion thereof, may be used as 



a probe to screen a nucleic acid library in order to isolate monoterpene synthase 
5 clones of the present invention, according to methods well known to one of ordinary 
skill in the art. See, e.g., Sambrook et a!, supra. The stringency of the hybridization 
and wash conditions during library screening in accordance with the present 
invention is at least: for the hybridization step, 6X SSPE buffer at 45°C to 50°C for 
36 hours; for the wash step, 3X SSPE buffer at 50°C (three, fifteen minute washes). 

10 In accordance with the present invention, the presently preferred hybridization and 
wash conditions during library screening utilizing any of the gymnosperm 
monoterpene synthase clones set forth in SEQ ID NO:l, SEQ ID NO:3 and SEQ ID 
NO:5, or a portion thereof, as probe are: for the hybridization step, 6X SSPE, 40- 
45°C, for 36 hours; for the wash step. 0.1X SSPE, 70°C-75°C, 3X15 minute washes. 

15 Additionally, at least two oligonucleotide sequence(s), each corresponding to 

at least a portion of at least one of the amino acid sequences set forth in Table 1, can 
be used in a PCR reaction to generate a portion of a monoterpene synthase clone of 
the present invention, which can be used as a probe to isolate a full-length clone of a 
monoterpene synthase clone of the present invention. Thus, oligonucleotides that are 

20 useful as probes in the forward PCR reaction correspond to at least a portion of at 
least one of the amino acid sequences disclosed in Table 1 as having the "F M 
orientation. Conversely, oligonucleotides that are useful as probes in the reverse 
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PCR reaction correspond to at least a portion of at least one of the amino acid 
sequences disclosed in Table 1 as having the "R" orientation. PCR reactions can be 
carried out according to an-recognized PCR reaction conditions, such as the PCR 
reaction conditions set forth in Example 1 herein and as set forth in n PCR Strategies", 
5 M.A. Innis, D.H. Gelfand and J.J. Sninsky. eds.. 1995. Academic Press. San Diego, 
CA (Chapter 14); "PCR Protocols: A Guide to Methods and Applications". M.A. 
Innis, D.H. Gelfand, J.J. Sninsky and T.J. White, eds.. Academic Press, NY (1990). 
The presently preferred PCR reaction conditions are: 

dNTPs 200 jaM each 

MgCl 2 5-7 mM 

F and R primers 100 nM - 1 uM each 

Taq polymerase 1-2 units/reaction 

cDN A template 10-100 ng/reaction 

Buffers, PCR grade water, and Chill-out wax or mineral oil 



10 



The presently preferred thermocycler conditions are: 

Denaturation 94° x 2 min 1 cycle 

Denaturation 94° x 45 s 35 cycles 

Annealing 42° - 55° x 45 s - 1 min 

Polymerization 72° x 1-2 min 

Adenylation 72° x 10 min 1 cycle 

While the preferred embodiment of the invention has been illustrated and 
15 described, it will be appreciated that various changes can be made therein without 
departing from the spirit and scope of the invention. 
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The embodiments of the invention in which an exclusive property or privilege 
is claimed are defined as follows: 

1- An isolated nucleotide sequence encoding a gymnosperm 
monoterpene synthase, said isolated nucleotide sequence being capable of 
hybridizing under stringent conditions to at least one oligonucleotide having a base 
sequence that corresponds to at least one peptide sequence selected from the group 
consisting of SEQ ID NO:46. SEQ ID NO:47. SEQ ID NO:48, SEQ ID NO;49. SEQ 
ID NO:50. SEQ ID NO:5L SEQ ID NO:52 and SEQ ID NO:53. 

2. An isolated nucleotide sequence of Claim 1 encoding a gymnosperm 
myrcene synthase. 

3. An isolated nucleotide sequence of Claim 1 encoding a gymnosperm 
limonene synthase. 

4. An isolated nucleotide sequence of Claim 1 encoding a gymnosperm 
pinene synthase. 

5. An isolated nucleotide sequence of Claim 1 encoding a Grand fir 
(Ab ies grandis) monoterpene synthase. 

6. An isolated nucleotide sequence of Claim 1 encoding a Grand fir 
{Abies grandis) myrcene synthase. 

7. An isolated nucleotide sequence of Claim 1 encoding a Grand fir 
(Abies grandis) limonene synthase. 

8. An isolated nucleotide sequence of Claim 1 encoding a Grand fir 
(Abies grandis) pinene synthase. 

9. An isolated nucleotide sequence of Claim 1 having the sequence of 
SEQ ID NO:l. 

10. An isolated nucleotide sequence of Claim 1 having the sequence of 
SEQ IDNO:3. 
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11. An isolated nucleotide sequence of Claim 1 having the sequence of 
SEQ ID NO:5 

12. An isolated nucleotide sequence of Claim 1 which encodes the amino 
acid sequence of SEQ ID NO:2. 

13. An isolated nucleotide sequence of Claim 1 which encodes the amino 
acid sequence of SEQ ID NO:4 

14. An isolated nucleotide sequence of Claim 1 which encodes the amino 
acid sequence of SEQ ID NO:6. 

15. An isolated nucleotide sequence encoding a cymnosperm 
monoterpene synthase protein, said isolated nucleotide sequence havins a 
complementary nucleotide sequence that is capable of hybridizing to the nucleotide 
sequence of any one of SEQ ID NO:L SEQ ID NO:3 and SEQ ID NO:5. 

16. An isolated nucleic acid molecule encoding a gymnosperm 
monoterpene synthase, said nucleic acid molecule being capable of hybridizing to at 
least two oligonucleotides, each of said oligonucleotides having a base sequence that 
corresponds to at least one peptide sequence selected from the group consisting of 
SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, 
SEQ ID NO:5L SEQ ID NO:52 and SEQ ID NO:53. 

17. An isolated nucleic acid molecule encoding a Grand fir (Abies 
grandis) monoterpene synthase, said nucleic acid molecule being capable of 
hybridizing to at least two oligonucleotides, each of said oligonucleotides having a 
base sequence that corresponds to at least one peptide sequence selected from the 
group consisting of SEQ ID NO:46, SEQ ID NO:47. SEQ ID NO:48, SEQ ID 
NO:49. SEQ ID NO:50, SEQ ID NO:5K SEQ ID NO:52 and SEQ ID NO:53. 

18. An isolated gymnosperm monoterpene synthase protein, said 
gymnosperm monoterpene synthase protein including at least one amino acid 
sequence element selected from the group consisting of SEQ ID NO:46, SEQ ID 
NO:47, SEQ ID NO:48, SEQ ID NO:49. SEQ ID NO:50. SEQ ID NO:51, SEQ ID 
NO:52 and SEQ ID NO:53. 



WO 99/02030 



PCT/US98/14528 



19. An isolated gymnosperrn myrcene synthase protein, said gymnospenn 
myrcene synthase protein including at least one amino acid sequence element 
selected from the group consisting of SEQ ID NO:46 ? SEQ ID NO:47, SEQ ID 
NO:48. SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:51. SEQ ID NO:52 and SEQ 
ID NO:53. 

20. An isolated gymnosperrn pinene synthase protein, said gymnosperrn 
pinene synthase protein including at least one amino acid sequence element selected 
from the group consisting of SEQ ID NO:46. SEQ ID NO:47. SEQ ID NO:48. SEQ 
ID NO:49, SEQ ID NO:50, SEQ ID NO:5 1 , SEQ ID NO:52 and SEQ ID NO:53. 

21. An isolated gymnosperrn limonene synthase protein, said 
gymnosperrn limonene synthase protein including at least one amino acid sequence 
element selected from the group consisting of SEQ ID NO;46, SEQ ID NO:47, SEQ 
ID NO:48, SEQ ID NO;49, SEQ ID NO:50, SEQ ID NO:51, SEQ ID NO:52 and 
SEQ ID NO:53. 

22. An isolated Grand fir (Abies grand is) monoterpene synthase protein, 
said Grand fir (Abies grandis) monoterpene synthase protein including at least one 
amino acid sequence element selected from the group consisting of SEQ ID NO:46, 
SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:51, 
SEQ ID NO:52 and SEQ ID NO:53. 

23. An isolated Grand fir (Abies grandis) monoterpene synthase protein of 
Claim 22, said monoterpene synthase protein having the sequence of SEQ ID NO:2. 

24. An isolated Grand fir (Abies grandis) monoterpene synthase protein of 
Claim 22, said monoterpene synthase protein having the sequence of SEQ ID NO:4. 

25. An isolated Grand fir (Abies grandis) monoterpene synthase protein of 
Claim 22, said monoterpene synthase protein having the sequence of SEQ ID NO:6. 

26. A replicable expression vector comprising a nucleotide sequence 
encoding a gymnosperrn monoterpene synthase, said nucleotide sequence encoding a 
gymnosperrn monoterpene synthase being capable of hybridizing at high stringency 
to at least one oligonucleotide having a base sequence that corresponds to at least one 
peptide sequence selected from the group consisting of SEQ ID NO:46, SEQ ID 
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NO:47. SEQ ID NO:4S. SEQ ID NO:49. SEQ ID NO.50. SEQ ID NO:51. SEQ ID 
NO:52and SEQ IDNO:53. 

ni A replicable expression vector of Claim 26 wherein sa.d nucleotide 
sequence encoding a gymnosperm monoterpene synthase encodes a Grand fir {Abies 
grandis) monoterpene synthase. 

28. A host cell comprising a vector of any one of claims 26 and 27. 

->9 A replicable expression vector comprising a nucleotide sequence 
encodino a BV mnosperm monoterpene synthase, said nucleotide sequence having a 
complementary nucleotide sequence that is capable of hybridizing to the nucleot.de 
sequence of any one of SEQ ID NO: 1. SEQ ID NO:3 and SEQ ID NO:5. 

30. A host cell comprising a vector of Claim 29. 

31. A method of enhancing the production of a gymnosperm monoterpene 
synthase in a suitable host cell comprising introducing into the host cell an 
expression vector of Claim 26 under conditions enabling expression of the 
gymnosperm monoterpene synthase in the host cell. 

3-7 A method for enhancing monoterpene levels in a host cell, said 
method "comprising transforming said host cell with a nucleic acid construct 
comprising at least one nucleic acid sequence defining a transcriptional initiation 
region from a eene expressed in the host cell and at least one nucleic acid sequence 
encoding a evmnosperm monoterpene synthase, said nucleic acid sequence encoding 
a eymnosperm monoterpene synthase being capable of hybridizing at high stringency 
to at least one oligonucleotide having a base sequence that corresponds to at least one 
peptide sequence selected from the group consisting of SEQ ID NO:46, SEQ ID 
NO:47, SEQ ID NO:48, SEQ ID NO:49. SEQ ID NO.50, SEQ ID NO:51, SEQ ID 
NO:52 and SEQ IDNO:53. 

33 The method of Claim 32 wherein said nucleic acid sequence encoding 
a gymnosperm monoterpene synthase encodes a Grand fir {Abies grandis) 
monoterpene synthase. 

34 A method for altering monoterpene levels in a seed from a host plant, 
said method comprising transforming said host plant with a nucleic acid construct 
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comprising at least one nucleic acid sequence defining a transcriptional initiation 
region from a gene expressed in a plant seed and at least one nucleic acid sequence 
encoding a gymnosperm monoterpene synthase, said nucleic acid sequence encoding 
a gymnosperm monoterpene synthase being capable of hybridizing at high stringency 
to at least one oligonucleotide having a base sequence that corresponds to at least one 
peptide sequence selected from the group consisting of SEQ ID NO:46, SEQ ID 
NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:51, SEQ ID 
NO:52and SEQIDNO:53. 

35. The method of Claim 34 wherein the transcriptional initiation region 
is from a gene preferentially expressed in a plant seed. 

3o. The method of Claim 34 wherein said nucleic acid sequence encoding 
a gvmnosperm monoterpene synthase encodes a Grand fir (Abies xrandis) 
monoterpene synthase. 

37. The method of Claim 34 wherein said gymnosperm monoterpene 
synthase is not native to said host plant. 

38. A transgenic plant produced according to the method of Claim 34. 

39. A method for increasing the monoterpene biosynthetic flux in seed 
from a host plant, said method comprising transforming said host plant with a 
construct comprising as operably linked components, a transcriptional initiation 
region from a gene preferentially expressed in a plant seed, a plastid transit peptide, a 
DNA sequence encoding a gymnosperm monoterpene synthase protein, and a 
transcriptional termination region, said DNA sequence encoding a gymnosperm 
monoterpene synthase protein being capable of hybridizing at high stringency to at 
least one oligonucleotide having a base sequence that corresponds to at least one 
peptide sequence selected from the group consisting of SEQ ID NO:46, SEQ ID 
NO:47. SEQ ID NO:48, SEQ ID NO:49. SEQ ID NO:50, SEQ ID NO:51, SEQ ID 
NO:52 and SEQ ID NO:53. 

40. The method of Claim 39, wherein said DNA sequence encoding a 
gymnosperm monoterpene synthase protein encodes an enzyme selected from the 
group consisting of limonene synthase, myrcene synthase and pinene synthase. 
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41. A method tor producing a monoierpene compound in a seed, said 
method comprising the steps of obtaining a transformed plant which produces said 
seed, said plant having and expressing in its genome: 

a nucleic acid sequence encoding a gymnosperm monoterpene synthase 
protein which is operably linked to a plastid transit peptide and a transcriptional 
initiation region from a gene preferentially expressed in a plant seed, wherein said 
nucleic acid sequence encoding a gymnosperm monoterpene synthase protein is 
capable of hybridizing at high stringency to at least one oligonucleotide having a 
base sequence that corresponds to at least one peptide sequence selected from the 
group consisting of SEQ ID NO:46. SEQ ID NO:47, SEQ ID NO:48. SEQ ID 
NO:49. SEQ ID NO:50. SEQ IDNO:51, SEQ ID NO:52 and SEQ ID NO:53. 

42. A seed transformed with a nucleic acid sequence encoding a 
gymnosperm monoterpene synthase protein, said nucleic acid sequence encoding a 
gymnosperm monoterpene synthase protein being capable of hybridizing at high 
stringency to at least one oligonucleotide having a base sequence that corresponds to 
at least one peptide sequence selected from the group consisting of SEQ ID NO:SEQ 
ID NO:46. SEQ ID NO:47. SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ 
ID NO:51, SEQ ID NO:52 and SEQ ID NO:53. said transformed seed having altered 
monoterpene levels. 

43. The transformed seed of claim 42. wherein said seed produces 
increased levels of at least one monoterpene compound, said monoterpene compound 
selected from the group consisting of (-)-4S-iimonene, (-)-a-pinene, myrcene, 4S-P- 
phellandrene. 

44. The transformed seed of claim 43, wherein said seed produces 
increased levels of (-)4S-limonene. 

45. Seed produced by the method of any one of claims 34, 35, 36, 37, 39 
40 and 41. 

46. Plants produced by the method of any one of claims 34, 35, 36, 37, 
39, 40 and 41. 
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47. Oil extracted from seeds produced by the method of any one of claims 
34.35,36.37.39,40 and 41. 

48. Meal extracted from seed produced by the method of any one of 
claims 34, 35, 36, 37, 39, 40 and 41 . 

49. The method of any one of claims 34, 35, 36, 37, 39, 40 and 41, 
wherein said seed is from a plant selected from the group consisting of oilseed 
Brassica. cotton, soybean, safflower, sunflower, coconut, palm, wheat, barley, rice, 
corn, oats, amaranth, pumpkin, squash, sesame, poppy, grape, mung beans, peanut, 
peas, beans, radish, alfalfa, cocoa, coffee, and tree nuts. 

50. The method of Claim 49, wherein said seed is from an oilseed crop 
plant selected from the group consisting of oilseed brassica, cotton, soybean, 
safflower, sunflower, palm, coconut, and corn. 
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SEQUENCE LISTING 

<110> Croteau, Rodney B 
Bohlmann, Carl J 
Steele, Christopher L 

<120> Monoterpene Synthases from Grand Fir (Abies grandis) 
<130> WSUR112537 

<140> 
<141> 

<150> 60/052,249 
<151> 1997-07-11 

<160> 62 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 2196 
<212> DNA 

<213> Abies grandis 

<220> 

<221> CDS 

<222> (69) . . (1952) 

<223> Clone AG2 . 2 encoding myrcene synthase 
<400> 1 

tgccggcacg aggttatctt gagcttcctc catataggcc aacacatatc atatcaaagg 60 

gagcaaga atg get ctg gtt tct ate tea ccg ttg get teg aaa tct tge 110 
Met Ala Leu Val Ser He Ser Pro Leu Ala Ser Lys Ser Cys 
1 5 io 



ctg cgc aag teg ttg ate agt tea att cat gaa cat aag cct ccc tat 
Leu Arg Lys Ser Leu He Ser Ser He His Glu His Lys Pro Pro Tvr 
15 20 25 30 



158 



a™ th 3 Ctt atg Cgt agg Cga aaa tct gtc acg 206 

Arg Thr He Pro Asn Leu Gly Met Arg Arg Arg Gly Lys Ser Val Thr 

35 40 45 

cct tec atg age ate agt ttg gee ace get gca cct gat gat ggt gta 254 

Pro Ser Met Ser He Ser Leu Ala Thr Ala Ala Pro Asp Asp Gly Val 

50 55 60 



caa aga cgc ata ggt gac tac cat tec aat ate tgg gac gat gat ttc 
Gin Arg Arg He Gly Asp Tyr His Ser Asn He Trp Asp Asp Asp Phe 



302 



65 -70 



75 



ata cag tct eta tea acg cct tat ggg gaa ccc tct tac cag gaa cgt 

He Gin Ser Leu Ser Thr Pro Tyr Gly Glu Pro Ser Tyr Gin Glu Arg 

80 85 90 

get gag aga tta att gtg gag gta aag aag ata ttc aat tea atg tac 

Ala Glu Arg Leu He Val Glu Val Lys Lys He Phe Asn Ser Met Tyr 

95 100 105 no 



350 



398 



1 
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ctg gat gat gga aga tta atg agt tec ttt aat gat etc atg caa cqc 
Leu Asp Asp Gly Arg Leu Met Ser Ser Phe Asn Asp Leu Met Gin Arg 

115 120 125 



aag aac gag ata aca tea get ctg gat tat gtt ttc cgt tac tgq qaq 
Lys Asn Glu He Thr Ser Ala Leu Asp Tyr Val Phe Arg Tyr Tip Glu 
145 150 155 



gaa aac ggc att gga tgt ggg aga gac agt att gtt act gat etc aac 
Glu Asn Gly He Gly Cys Gly Arg Asp Ser He Val Thr Asp Leu Asn 
160 165 170 



tt9 " g ° ga aCt Ctt cga tta cac tac act gta 

Ser Thr Ala Leu Gly Phe Arg Thr Leu Arg Leu His Gly Tyr Thr Val 

190 



175 180 185 



tct cca gag gtt tta aaa get ttt caa gat caa aat gga cag ttt gta 

Ser Pro Glu Val Leu Lys Ala Phe Gin Asp Gin Asn Gly Gin Phe Val 

195 200 205 

tgc tec ccc ggt cag aca gag ggt gag ate aga age gtt ctt aac tta 

Cys Ser Pro Gly Gin Thr Glu Gly Glu He Arg Ser Val Leu Asn Leu 



210 215 

tat egg get tec etc att gee ttc cet ggt gag aaa gtt atg gaa gaa 

Tyr Arg Ala Ser Leu He Ala Phe Pro Gly Glu Lys Val Met Glu Glu 
225 230 235 

get gaa ate ttc tec aca aga tat ttg aaa gaa get eta caa aag att 

Ala Glu He Phe Ser Thr Arg Tyr Leu Lys Glu Ala Leu Gin Lys He 

240 245 250 

cca gtc tec get ctt tea caa gag ata aag ttt gtt atg gaa tat ggc 

Pro Val Ser Ala Leu Ser Gin Glu He Lys Phe Val Met Glu Tyr Glv 

255 260 265 270 

tgg cac aca aat ttg cca aga ttg gaa gca aga aat tac ata gac aca 

Trp His Thr Asn Leu Pro Arg Leu Glu Ala Arg Asn Tyr He Asp Thr 

275 280 285 

ctt gag aaa gac ace agt gca tgg etc aat aaa aat get ggg aag aag 

Leu Glu Lys Asp Thr Ser Ala Trp Leu Asn Lys Asn Ala Gly Lvs Lvs 

290 295 300 

ctt tta gaa ctt gca aaa ttg gag ttc aat ata ttt aac tec tta caa 

Leu Leu Glu Leu Ala Lys Leu Glu Phe Asn He Phe Asn Ser Leu Gin 
305 310 3i 5 

caa aag gaa tta caa tat ctt ttg aga tgg tgg aaa gag teg gat ttg 

Gin Lys Glu Leu Gin Tyr Leu Leu Arg Trp Trp Lys Glu Ser Asp Leu 

320 325 330 

cct aaa ttg aca ttt get egg cat cgt cat gtg gaa ttc tac act ttq 

Pro Lys Leu Thr Phe Ala Arg His Arg His Val Glu Phe Tyr Thr Leu 

335 340 345 350 



446 



ctt tgg ata gtc gat age gtt gaa cgt ttg ggg ata get aga cat ttc 494 
Leu Trp He Val Asp Ser Val Glu Arg Leu Gly He Ala Arg His Phe 

130 135 140 



542 



590 



638 



686 



734 



782 



830 



878 



926 



974 



1022 



1070 



1118 
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gcc tct tgt att gcc att gac cca aaa cat tct gca ttc aaa eta enr m 
Ala Ser Cys He Ala He Asp P ro Lys His Ser Ala Phe Arg Leu III 

355 360 365 

ttc gcc aaa atg tgt cat ctt gtc aca gtt ttg gac gat att tac qac 1214 
Phe Ala Lys Met Cys His Leu Val Thr Val Leu Asp L P H e Tyr Asp 

370 375 380 

act ttt gga acg att gac gag ctt gaa etc ttc aca tct aca att- i 
Thr Phe Gly Thr He Asp Glu Leu Glu Leu Phe Thr S er Ala" ill III 
385 3g5 y 

aga tgg aat tea tea gag ata gaa cac ctt cca gaa tat ata aaa tat mn 
9 loo Ht G1U H1S LeU Pr ° G1 " ^ ^ Cys" 

gtg tac atg gtc gtg ttt gaa act gta aat gaa ctg aca cqa aaa aca i«» 
Val Tyr Met val Val Phe Glu Thr Val Asn Glu Leu Thr Arg 111 £J ^ 

420 425 

gag aag act caa ggg aga aac act etc aac tat gtt cga aag act taa 1406 
Glu Lys Thr Gin Gly Arg Asn Thr Leu Asn Tyr Val Arg Lys Ala Trp 

435 440 445 

Glu HI HI Pne 933 933 9Ca 333 tg * atc tct 1454 

Glu Ala Tyr Phe Asp Ser Tyr Met Glu Glu Ala Lys Trp lie Ser Asn 

450 455 460 

r 9t f tg ° Ca 3tg ttfc 933 9a 9 tac «t gag aat ggg aaa gtq aqc 1502 

Gly Tyr Leu Pro Met Phe Glu Glu Tyr His Glu Asn Gly Lys S s 2 

465 470 475 

tct gca tat cgc gta gca aca ttg caa ccc ate etc act tta aat ac* 1 
Ser Ala Tyr Arg Val Ala Thr Leu Gin Pro He Leu Thr ^ f£ 1550 

III fr t Ct ? at atC tt9 339 gga att ^ at ttfc c « tec agg ttc 1598 

Trp Leu Pro Asp Tyr He Leu Lys Gly He Asp Phe Pro Ser Arg Phe 

5 500 505 510 

aat gat ttg gca teg tec ttc ctt egg eta cga ggt gac aca cac tac 164fi 
Asn Asp Leu Ala Ser Ser Phe Leu Arg Leu Arg G? y Asp Thr Arg Cys 

515 520 525 

Hr 111 l,Z ^ 359 gat Cgt ggt 933 933 gCt tCg t 9 fc ata tea tgt 1694 
Tyr Lys Ala Asp Arg Asp Arg Gly Glu Glu Ala Ser Cys He Ser Cvs 

530 535 54Q 

£vr Mf»t I 30 CCt 993 tCa 3CC gaa gaa 9 at acc ct c aat cat 1742 

Tyr Met Lys Asp Asn Pro Gly Ser Thr Glu Glu Asp Ala Leu Asn His 
545 55 0 555 

ne Asn A^a Met V^ ^ "* aat ^ ^a ctt 1790 

lie Asn Ala Met Val Asn Asp He He Lys Glu Leu Asn Trp Glu Leu 
560 565 57Q 

Leu Aro r C att CCa atg Ctg gCC aa ^ aaa cat 9^ ttt 1838 

Leu Arg Ser Asn Asp Asn He Pro Met Leu Ala Lys Lys His Ala Phe 

75 580 585 5 90 
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1934 



gac ata aca aga get etc cac cat etc tac ata tat cqa oat aac ttr iBRf 
Asp lie Thr Arg Ala Leu His His Leu Tyr lie Tyr Arg Asp Gly Phe 

595 600 605 

S^r ^ C r C gtt 9" Ctc Ctt 

Ser Val Ala Asn Lys Glu Thr Lys Lys Leu Val Met Glu Thr Leu Leu 

610 615 6 20 

gaa tct atg ctt ttt taa ctataaccat atccataata ataagctcat 198 o 

Glu Ser Met Leu Phe -i^o- 
625 

aatgctaaat tattggcett atgacatagt ttatgtatgt acttgtgtga attcaatcat 2042 

atcgtgtggg tatgattaaa aagctagage ttactaggtt agtaacatgg tgataaaagt 2102 

tataaaatgt gagttataga gatacecatg ttgaataatg aattacaaaa agagaaattt 2162 

atgtagaata agattggaag cttttcaatt gttt 



196 



<210> 2 
<211> 627 
<212> PRT 

<213> Abies grandis 
<400> 2 

Met Ala Leu Val Ser He Ser Pro Leu Ala Ser Lys Ser Cys Leu Arg 

5 10 15 

Lys Ser Leu lie Ser Ser lie His Glu His Lys Pro Pro Tyr Arg Thr 

20 25 30 

He Pro Asn Leu Gly Met Arg Arg Arg Gly Lys Ser Val Thr Pro Ser 
35 40 45 

Met Ser He Ser Leu Ala Thr Ala Ala Pro Asp Asp Gly Val Gin Arg 
50 55 eo 9 

Arg lie Gly Asp Tyr His Ser Asn He Trp Asp Asp Asp Phe He Gin 
65 70 75 80 

Ser Leu Ser Thr Pro Tyr Gly Glu Pro Ser Tyr Gin Glu Arg Ala Glu 

85 90 95 

Arg Leu He Val Glu Val Lys Lys He Phe Asn Ser Met Tyr Leu Asp 

100 105 110 

Asp Gly Arg Leu Met Ser Ser Phe Asn Asp Leu Met Gin Arg Leu Trp 
115 120 125 

He Val Asp Ser Val Glu Arg Leu Gly He Ala Arg His Phe Lys Asn 
130 135 140 

Glu He Thr Ser Ala Leu Asp Tyr Val Phe Arg Tyr Trp Glu Glu Asn 

14 5 ISO Ttt 

155 160 

Gly He Gly Cys Gly Arg Asp Ser He Val Thr Asp Leu Asn Ser Thr 

165 170 175 
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Ala Leu Gly Phe Arg Thr Leu Arg Leu His Gly Tyr Thr Val Ser Pro 

180 185 190 

Glu Val Leu Lys Ala Phe Gin Asp Gin Asn Gly Gin Phe Val Cys Ser 
195 200 205 

Pro Gly Gin Thr Glu Gly Glu He Arg Ser Val Leu Asn Leu Tyr Arq 

210 215 220 

Ala Ser Leu He Ala Phe Pro Gly Glu Lys Val Met Glu Glu Ala Glu 
225 230 235 2 4o 

He Phe Ser Thr Arg Tyr Leu Lys Glu Ala Leu Gin Lys lie Pro Val 

245 250 255 

Ser Ala Leu Ser Gin Glu He Lys Phe Val Met Glu Tyr Gly Trp His 

260 265 270 



Thr Asn Leu Pro Arg Leu Glu Ala Arg Asn Tyr He Asp Thr 



7 $ 280 



Leu Glu 



285 



Lys Asp Thr Ser Ala Trp Leu Asn Lys Asn Ala Gly Lys Lys Leu Leu 
290 295 300 

Glu Leu Ala Lys Leu Glu Phe Asn He Phe Asn Ser Leu Gin Gin Lys 
30S 310 315 3 2 o 

Glu Leu Gin Tyr Leu Leu Arg Trp Trp Lys Glu Ser Asp Leu Pro Lys 

325 330 335 

Leu Thr Phe Ala Arg His Arg His Val Glu Phe Tyr Thr Leu Ala Ser 

340 345 350 

Cys He Ala He Asp Pro Lys His Ser Ala Phe Arg Leu Gly Phe Ala 
3 ^5 360 365 

Lys Met Cys His Leu Val Thr Val Leu Asp Asp He Tyr Asp Thr Phe 
370 375 380 

Gly Thr He Asp Glu Leu Glu Leu Phe Thr Ser Ala He Lys Arq Trp 
385 390 395 400 

Asn Ser Ser Glu He Glu His Leu Pro Glu Tyr Met Lys Cys Val Tyr 

4 05 410 415 

Met Val Val Phe Glu Thr Val Asn Glu Leu Thr Arg Glu Ala Glu Lys 

42 0 425 430 

Thr Gin Gly Arg Asn Thr Leu Asn Tyr Val Arg Lys Ala Trp Glu Ala 
435 440 445 

Tyr Phe Asp Ser Tyr Met Glu Glu Ala Lys Trp He Ser Asn Gly Tyr 
450 455 460 

Leu Pro Met Phe Glu Glu Tyr His Glu Asn Gly Lys Val Ser Ser Ala 
465 470 475 480 

Tyr Arg Val Ala Thr Leu Gin Pro He Leu Thr Leu Asn Ala Trp Leu 

485 490 495 
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Pro Asp Tyr lie 

500 

Leu Ala Ser Ser 
515 

Ala Asp Arg Asp 
530 

Lys Asp Asn Pro 
545 

Ala Met Val Asn 



Ser Asn Asp Asn 

580 

Thr Arg Ala Leu 
595 

Ala Asn Lys Glu 
610 

Met Leu Phe 
625 



Leu Lys Gly lie 



Phe Leu Arg Leu 

520 

Arg Gly Glu Glu 
535 

Gly Ser Thr Glu 
550 

Asp lie lie Lys 
565 

lie Pro Met Leu 



His His Leu Tyr 

600 

Thr Lys Lys Leu 
615 



Asp Phe Pro Ser 
505 

Arg Gly Asp Thr 



Ala Ser Cys lie 

540 

Glu Asp Ala Leu 
555 

Glu Leu Asn Trp 
570 

Ala Lys Lys His 
585 

lie Tyr Arg Asp 



Val Met Glu Thr 

620 



Arg Phe Asn Asp 
510 

Arg Cys Tyr Lys 
525 

Ser Cys Tyr Met 



Asn His lie Asn 

560 

Glu Leu Leu Arg 
575 

Ala Phe Asp lie 
590 

Gly Phe Ser Val 
605 

Leu Leu Glu Ser 



<210> 3 

<211> 2018 

<212> DNA 

<213> Abies grandis 

<220> 

<221> CDS 

<222> (6) . . (1892) 

<223> Clone AG3.18 encoding pinene synthase 
<400> 3 

cagca atg get eta gtt tct acc gca ccg ttg get tec aaa tea tgc ctg 
Met Ala Leu Val Ser Thr Ala Pro Leu Ala Ser Lys Ser Cys Leu 
15 10 15 

cac aaa teg ttg ate agt tct acc cat gag ctt aag get etc tct aga 
His Lys Ser Leu lie Ser Ser Thr His Glu Leu Lys Ala Leu Ser Arg 

20 25 30 

aca att cca get eta gga atg agt agg cga ggg aaa tct ate act cct 
Thr lie Pro Ala Leu Gly Met Ser Arg Arg Gly Lys Ser lie Thr Pro 

35 40 45 

tec ate age atg age tct acc acc gtt gta acc gat gat ggt gta cga 
Ser lie Ser Met Ser Ser Thr Thr Val Val Thr Asp Asp Gly Val Arg 
50 55 60 

aga cgc atg ggc gat ttc cat tec aac etc tgg gac gat gat gtc ata 
Arg Arg Met Gly Asp Phe His Ser Asn Leu Trp Asp Asp Asp Val lie 
65 70 75 



50 



98 



146 



194 



242 



6 



WO 99/02030 



PCT/US98/14528 



ttt gga cag gac act gag aac acg aag tea tat gtg aag age aaa aaa 
Phe Gly Gin Asp Thr Glu Asn Thr Lys Ser Tyr Val Lys Ser Lvs Lvs 
290 295 300 



cag tct tta cca acg get tat gag gaa aaa teg tac ctg gag cgt get 290 

Gin Ser Leu Pro Thr Ala Tyr Glu Glu Lys Ser Tyr Leu Glu Arq Ala 

80 85 90 95 

gag aaa ctg ate ggg gaa gta aag aac atg ttc aat teg atg tea tta 338 

Glu Lys Leu He Gly Glu Val Lys Asn Met Phe Asn Ser Met Ser Leu 

100 105 no 

gaa gat gga gag tta atg agt ccg etc aat gat etc att caa cgc ctt 386 

Glu Asp Gly Glu Leu Met Ser Pro Leu Asn Asp Leu He Gin Arg Leu 

115 120 125 

tgg att gtc gac age ctt gaa cgt ttg ggg ate cat aga cat ttc aaa 434 

Trp He Val Asp Ser Leu Glu Arg Leu Gly He His Arg His Phe Lvs 

130 135 140 

gat gag ata aaa teg gcg ctt gat tat gtt tac agt tat tgg ggc gaa 482 

Asp Glu He Lys Ser Ala Leu Asp Tyr Val Tyr Ser Tyr Trp Gly Glu 

145 150 155 

aat ggc ate gga tgc ggg agg gag agt gtt gtt act gat ctg aac tea 530 

Asn Gly lie Gly Cys Gly Arg Glu Ser Val Val Thr Asp Leu Asn Ser 

160 165 170 175 

act gcg ttg ggg ctt cga ace eta cga eta cac gga tac ccg gtg tct 578 

Thr Ala Leu Gly Leu Arg Thr Leu Arg Leu His Gly Tyr Pro Val Ser 

1 Q 0 185 190 

tea gat gtt ttc aaa get ttc aaa ggc caa aat ggg cag ttt tec tgc 626 

Ser Asp Val Phe Lys Ala Phe Lys Gly Gin Asn Gly Gin Phe Ser Cys 

i95 200 205 

tct gaa aat att cag aca gat gaa gag ate aga ggc gtt ctg aat tta 674 

Ser Glu Asn He Gin Thr Asp Glu Glu He Arg Gly Val Leu Asn Leu 

210 215 220 

ttc egg gee tec etc att gee ttt cca ggg gag aaa att atg gat gag 722 

Phe Arg Ala Ser Leu He Ala Phe Pro Gly Glu Lys He Met Asp Glu 

225 230 235 

get gaa ate ttc tct ace aaa tat tta aaa gaa gec ctg caa aag att 770 

Ala Glu He Phe Ser Thr Lys Tyr Leu Lys Glu Ala Leu Gin Lys He 

240 245 250 255 

ccg gtc tec agt ctt teg cga gag ate ggg gac gtt ttg gaa tat ggt 818 

Pro Val Ser Ser Leu Ser Arg Glu He Gly Asp Val Leu Glu Tyr Gly 

260 265 270 

tgg cac aca tat ttg ccg cga ttg gaa gca agg aat tac ate caa gtc 866 

Trp His Thr Tyr Leu Pro Arg Leu Glu Ala Arg Asn Tyr He Gin Val 

275 280 285 



914 



ctt tta gaa etc gca aaa ttg gag ttc aac ate ttt caa tec tta caa 96 
Leu Leu Glu Leu Ala Lys Leu Glu Phe Asn He Phe Gin Ser Leu Gin 
305 310 315 
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aag agg gag tta gaa agt ctg gtc aga tgg tgg aaa gaa teg ggt ttt 1010 

Lys Arg Glu Leu Glu Ser Leu Val Arg Trp Trp Lys Glu Ser Gly Phe 
' i20 325 330 335 

cct gag atg acc ttc tgc cga cat cgt cac gtg gaa tac tac act ttg 1058 

Pro Glu Met Thr Phe Cys Arg His Arg His Val Glu Tyr Tyr Thr Leu 

340 345 350 

get tec tgc att gcg ttc gag cct caa cat tct gga ttc aga etc gge 1106 

Ala Ser Cys He Ala Phe Glu Pro Gin His Ser Gly Phe Arg Leu Gly 

355 360 365 



ttt gec aag acg tgt cat ett ate aeg gtt ett gac gat atg tac gae 1154 
Phe Ala Lys Thr Cys His Leu lie Thr Val Leu Asp Asp Met Tyr Asp 
370 375 3 8 o 

^ C It* ga ° 939 Ctg gaa Ctc ttc aca aca atg aag 

395 



^ y 9aC 9ag Ctg gaa Ctc ttc aca 9 C 9 atg aag 

Thr Phe Gly Thr Val Asp Glu Leu Glu Leu Phe Thr Ala Thr Met Lvs 

365 390 395 

aga tgg gat ccg tec teg ata gat tgc ett cca gaa tat atg aaa qqa 

Arg Trp Asp Pro Ser Ser He Asp Cys Leu Pro Glu Tyr Met Lys Gly 

400 405 410 41 5 

gtg tac ata gcg gtt tac gac acc gta aat gaa atg get cga gag gca 

Val Tyr He Ala Val Tyr Asp Thr Val Asn Glu Met Ala Arg Glu Ala 

420 425 430 

gag gag get caa ggc cga gat acg ctc aca tat get egg gaa get tgg 

Glu Glu Ala Gin Gly Arg Asp Thr Leu Thr Tyr Ala Arg Glu Ala Trp 

435 440 445 

gag get tat att gat teg tat atg caa gaa gca agg tgg ate gee act 

Glu Ala Tyr He Asp Ser Tyr Met Gin Glu Ala Arg Trp He Ala Thr 

450 455 46O 

ggt tac ctg ccc tee ttt gat gag tac tac gag aat ggg aaa gtt age 

Gly Tyr Leu Pro Ser Phe Asp Glu Tyr Tyr Glu Asn Gly Lys Val Ser 

465 470 475 

^ 9t ggt cgc ata tcc <? Ca "g «a ccc att ctg aca atg gac ate 

Cys Gly His Arg He Ser Ala Leu Gin Pro He Leu Thr Met Asp He 

480 485 490 P 495 

ccc ttt cct gat cat ate ctc aag gaa gtt gac ttc cca tea aag ett 

Pro Phe Pro Asp His He Leu Lys Glu Val Asp Phe Pro Ser Lys Leu 

500 505 510 

aac gac ttg gca tgt gec ate ett cga tta cga ggt gat acg egg tgc 

Asn Asp Leu Ala Cys Ala He Leu Arg Leu Arg Gly Asp Thr Arq Cys 

515 520 525 

tac aag gcg gac agg get cgt gga gaa gaa get tcc tct ata tea tgt 

lyr Lys Ala Asp Arg Ala Arg Gly Glu Glu Ala Ser Ser He Ser Cys 

530 535 540 

CCt 993 gta tCa gag gaa gat 9 ct etc gat cat 

Tyr Met Lys Asp Asn Pro Gly Val Ser Glu Glu Asp Ala Leu Asp His 

545 550 555 



1202 



1250 



1298 



1346 



1394 



1442 



1490 



1538 



1586 



1634 



16B2 
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r C ?? C 3t9 atC agt gaC gta atc aaa 99 a tta aat tgg gaa ctt 
lie Asn Ala Met lie Ser Asp Val He Lys Gly Leu Asn Trp Glu Leu 

ibU 565 570 575 



1730 



etc aaa cca gac atc aat gtt ccc atc teg gcg aag aaa cat get ttt 

Leu Lys Pro Asp He Asn Val Pro He Ser Ala Lys Lys His Ala Phe 

580 585 59Q 

gac atc gec aga get ttc cat tac ggc tac aaa tac cga gac ggc tac 

Asp He Ala Arg Ala Phe His Tyr Gly Tyr Lys Tyr Arg Asp Gly Tyr 

595 600 6 o5 

lit ?? C 9 f a aCg 339 agt ttg gtc ac 9 a <3* acc etc ctt 

Ser Val Ala Asn Val Glu Thr Lys Ser Leu Val Thr Arg Thr Leu Leu 

610 615 620 

gaa tct gtg cct ttg tag caacagctca aatctatgee etatgetatg 
Glu Ser Val Pro Leu 
625 



1 778 



1826 



1874 



192 



tcgggttaaa atatatgtgg aaggtagccg ttggatgtag aggataagtt tgttataatt 1982 
taataaagtt gtaatttaaa aaaaaaaaaa aaaaaa 2018 

<210> 4 
<211> 628 
<212> PRT 

<213> Abies grandis 
<400> 4 

Met Ala Leu Val Ser Thr Ala Pro Leu Ala Ser Lys Ser Cys Leu His 
1 5 10 15 

Lys Ser Leu He Ser Ser Thr His Glu Leu Lys Ala Leu Ser Arg Thr 

20 25 30 

He Pro Ala Leu Gly Met Ser Arg Arg Gly Lys Ser He Thr Pro Ser 
35 40 45 

He Ser Met Ser Ser Thr Thr Val Val Thr Asp Asp Gly Val Arg Arg 
^ u 55 60 

Arg Met Gly Asp Phe His Ser Asn Leu Trp Asp Asp Asp Val He Gin 
5 70 75 80 

Ser Leu Pro Thr Ala Tyr Glu Glu Lys Ser Tyr Leu Glu Arg Ala Glu 

85 90 95 

Lys Leu He Gly Glu Val Lys Asn Met Phe Asn Ser Met Ser Leu Glu 

1Q 0 105 no 

Asp Gly Glu Leu Met Ser Pro Leu Asn Asp Leu He Gin Arg Leu Trp 
115 120 125 

He Val Asp Ser Leu Glu Arg Leu Gly He His Arg His Phe Lys Asp 

^ u 135 14 o 

Glu He Lys Ser Ala Leu Asp Tyr Val Tyr Ser Tyr Trp Gly Glu Asn 
14 5 150 155 leo 
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Gly He Gly Cys Gly Arg Glu Ser Val Val Thr Asp Leu Asn Ser Thr 

165 170 175 

Ala Leu Gly Leu Arg Thr Leu Arg Leu His Gly Tyr Pro Val Ser Ser 

180 185 190 

Asp Val Phe Lys Ala Phe Lys Gly Gin Asn Gly Gin Phe Ser Cys Ser 
19S 200 205 

Glu Asn He Gin Thr Asp Glu Glu He Arg Gly Val Leu Asn Leu Phe 
210 215 220 

Arg Ala Ser Leu He Ala Phe Pro Gly Glu Lys He Met Asp Glu Ala 
225 230 235 240 

Glu He Phe Ser Thr Lys Tyr Leu Lys Glu Ala Leu Gin Lys He Pro 

245 250 255 

Val Ser Ser Leu Ser Arg Glu He Gly Asp Val Leu Glu Tyr Glv Tro 

260 265 270 

His Thr Tyr Leu Pro Arg Leu Glu Ala Arg Asn Tyr He Gin Val Phe 
275 280 285 

Gly Gin Asp Thr Glu Asn Thr Lys Ser Tyr Val Lys Ser Lys Lys Leu 
290 295 300 

Leu Glu Leu Ala Lys Leu Glu Phe Asn He Phe Gin Ser Leu Gin Lvs 
305 310 315 320 

Arg Glu Leu Glu Ser Leu Val Arg Tip Trp Lys Glu Ser Gly Phe Pro 

325 330 335 

Glu Met Thr Phe Cys Arg His Arg His Val Glu Tyr Tyr Thr Leu Ala 

340 345 350 

Ser Cys He Ala Phe Glu Pro Gin His Ser Gly Phe Arg Leu Gly Phe 
355 360 365 

Ala Lys Thr Cys His Leu He Thr Val Leu Asp Asp Met Tyr Asp Thr 
370 375 380 

Phe Gly Thr Val Asp Glu Leu Glu Leu Phe Thr Ala Thr Met Lys Ara 
385 39 ° 395 400 

Trp Asp Pro Ser Ser He Asp Cys Leu Pro Glu Tyr Met Lys Gly Val 

405 410 415 

Tyr He Ala Val Tyr Asp Thr Val Asn Glu Met Ala Arg Glu Ala Glu 

420 425 430 

Glu Ala Gin Gly Arg Asp Thr Leu Thr Tyr Ala Arg Glu Ala Trp Glu 
43 $ 440 445 

Ala Tyr He Asp Ser Tyr Met Gin Glu Ala Arg Trp He Ala Thr Gly 
450 455 460 

Tyr Leu Pro Ser Phe Asp Glu Tyr Tyr Glu Asn Gly Lys Val Ser Cys 
465 470 475 480 
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Gly His Arg He Ser Ala Leu Gin Pro He Leu Thr Met Asp He Pro 

485 495 

Phe Pro Asp His He Leu Lys Glu Val Asp Phe Pro Ser Lys Leu Asn 

500 505 510 

Asp Leu Ala Cys Ala He Leu Arg Leu Arg Gly Asp Thr Arg Cys Tyr 
ol5 520 525 

Lys Ala Asp Arg Ala Arg Gly Glu Glu Ala Ser Ser He Ser Cys Tvr 

530 535 540 y y 

Met Lys Asp Asn Pro Gly Val Ser Glu Glu Asp Ala Leu Asp His He 

550 555 560 

Asn Ala Met He Ser Asp Val He Lys Gly Leu Asn Trp Glu Leu Leu 

565 570 575 

Lys Pro Asp He Asn Val Pro He Ser Ala Lys Lys His Ala Phe Asp 

580 ses 5go 

He Ala Arg Ala Phe His Tyr Gly Tyr Lys Tyr Arg Asp Gly Tyr Ser 
595 6 oo 6 05 

Val Ala Asn Val Glu Thr Lys Ser Leu Val Thr Arg Thr Leu Leu Glu 
610 615 



620 



Ser Val Pro Leu 
625 



<210> 5 
<211> 2089 
<212> DNA 

<213> Abies grandis 

<220> 

<221> CDS 

<222> (73).. (1986) 

<223> Clone AGIO encoding limonene synthase 
<400> 5 

tgccgtttaa tcggtttaaa gaagctacca tagttcggtt taaagaagct accatagttt 60 

aggcaggaat cc atg get etc ctt tct ate gta tct ttg cag gtt ccc aaa 111 

Met Ala Leu Leu Ser He Val Ser Leu Gin Val Pro Lvs 
1 5 10 

S£r r?,« 2?* aa .! . ttg a f c ~ gt tcc agc aat gtg ca 3 a *g g ct 159 

Lsn 
25 



= " 3 y ^-'-y aLI - <»gt tcc age aat gtg cag aag act 

Ser Cys Gly Leu Lys Ser Leu He Ser Ser Ser Asn Val Gin Lvs Ala 
15 20 - 



flT, r 9t ty° *" 9Ca gt ° ° Ca aCa Ctc at 9 «=gt agg cga cag 207 

Leu Cys He Ser Thr Ala Val Pro Thr Leu Arg Met Arg Arg Arg Gin 

JU 35 40 45 

aaa get ctg gtc ate aac atg aaa ttg acc act gta tcc cat cgt gat 255 

Lys Ala Leu Val He Asn Met Lys Leu Thr Thr Val Ser His Arq Asn 

50 55 eo 



11 



WO 99/02030 



PCT/US98/14528 



gat aat ggt ggt ggt gta ctg caa aga cgc ata gcc gat cat cat ccc 303 
Asp Asn Gly Gly Gly Val Leu Gin Arg Arg He Ala Asp His His Pro 

65 70 75 



aac ctg tgg gaa gat gat ttc ata caa tea ttg tec tea cct tat ggg 

Asn Leu Trp Glu Asp Asp Phe He Gin Ser Leu Ser Ser Pro Tyr Gly 
80 85 90 

gga tct teg tac agt gaa cgt get gag aca gtc gtt gag gaa gta aaa 

Gly Ser Ser Tyr Ser Glu Arg Ala Glu Thr Val Val Glu Glu Val Lys 

95 100 105 



ate act aga agt gtt eta aat tta tat egg get tec ctg gtc gcc ttt 

He Thr Arg Ser Val Leu Asn Leu Tyr Arg Ala Ser Leu Val Ala Phe 

225 230 235 

ccc ggg gag aaa gtt atg gaa gag get gaa ate ttc teg gca tct tat 

Pro Gly Glu Lys Val Met Glu Glu Ala Glu He Phe Ser Ala Ser Tyr 
240 245 250 



351 



399 



gag atg ttc aat tea ata cca aat aat aga gaa tta ttt ggt tec caa 447 
Glu Met Phe Asn Ser He Pro Asn Asn Arg Glu Leu Phe Gly Ser Gin 
110 115 120 125 

aat gat etc ctt aca cgc ctt tgg atg gtg gat age att gaa cgt ctg 495 
Asn Asp Leu Leu Thr Arg Leu Trp Met Val Asp Ser He Glu Arg Leu 

130 135 140 

ggg ata gat aga cat ttc caa aat gag ata aga gta gcc etc gat tat 543 
Gly He Asp Arg His Phe Gin Asn Glu He Arg Val Ala Leu Asp Tyr 

145 150 155 

gtt tac agt tat tgg aag gaa aag gaa ggc att ggg tgt ggc aga gat 591 
Val Tyr Ser Tyr Trp Lys Glu Lys Glu Gly He Gly Cys Gly Arg Asp 
160 165 170 

tct act ttt cct gat etc aac teg act gcc ttg gcg ctt cga act ctt 639 
Ser Thr Phe Pro Asp Leu Asn Ser Thr Ala Leu Ala Leu Arg Thr Leu 
175 180 185 

cga ctg cac gga tac aat gtg tct tea gat gtg ctg gaa tac ttc aaa 687 
Arg Leu His Gly Tyr Asn Val Ser Ser Asp Val Leu Glu Tyr Phe Lys 
190 195 200 205 

gat gaa aag ggg cat ttt gcc tgc cct gca ate eta ace gag gga cag 735 
Asp Glu Lys Gly His Phe Ala Cys Pro Ala He Leu Thr Glu Gly Gin 

210 215 220 



783 



831 



ttg aaa aaa gtc tta caa aag att ccg gtc tec aat ctt tea gga gag 879 
Leu Lys Lys Val Leu Gin Lys He Pro Val Ser Asn Leu Ser Gly Glu 
255 260 265 

ata gaa tat gtt ttg gaa tat ggt tgg cac acg aat ttg ccg aga ttg 927 
He Glu Tyr Val Leu Glu Tyr Gly Trp His Thr Asn Leu Pro Arg Leu 
270 275 280 285 

gaa gca aga aat tat ate gag gtc tac gag cag age ggc tat gaa age 975 
Glu Ala Arg Asn Tyr He Glu Val Tyr Glu Gin Ser Gly Tyr Glu Ser 

290 295 300 
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llu Asn r? 9 »" 5" at9 3tg aag aag ctt tta caa ctt gca 1023 

Leu Asn Glu Met Pro Tyr Met Asn Met Lys Lys Leu Leu Gin Leu Ala 

305 310 315 

aaa ttg gag ttc aat ate ttt cac tct ttg caa eta aga gag tta caa 1071 
Lys Leu Glu Phe Asn He Phe His Ser Leu Gin Leu Arg Glu Leu Gin 
320 325 33 0 

tct ate tec aga tgg tgg aaa gaa tea ggt teg tct caa ctg act ttt 1119 
Ser lie Ser Arg Trp Trp Lys Glu Ser Gly Ser Ser Gin Leu Thr Phe 
335 340 345 

aca egg cat cgt cac gtg gaa tac tac act atg gca tct tqc att tct n fi7 
Thr Arg His Arg His Val Glu Tyr Tyr Thr Met Ada Ser Cys lie Ser 

355 360 3 6 5 

atg ttg cca aaa cat tea get ttc aga atg gag ttt gtc aaa gtg tgt 1215 
Met Leu Pro Lys His Ser Ala Phe Arg Met Glu Phe Val Lys Val Cys 

370 375 380 

His Leu Val Thr V^ *t* ^ ^ 3Ct aCa ^ 1263 

Leu Val Thr Val L eu Asp Asp He Tyr Asp Thr Phe Gly Thr Met 

385 390 3g5 

aac gaa etc caa ctt ttt acg gat gca att aag aga tgg gat ttg tea 1311 

1U ^ LeU PhC Thr AS P ^ a Ile Arg Trp Asp Leu Ser 

400 405 410 

Thr TKr 3 " l" T Ctt CCa 933 tat atg aaa gga gt 9 tac at 9 9 ac ttg 1359 
Thr Thr Arg Trp Leu Pro Glu Tyr Met Lys Gly Val Tyr Met Asp Leu 

415 420 425 

III r? C ^ ^f 3 at9 9tg 933 gag gCt gag aa « act caa Sec 1407 

Tyr Gin Cys He Asn Glu Met Val Glu Glu Ala Glu Lys Thr Gin Gly 

JU 4 35 440 445 

t g * ? at f tc aac tat att caa aat g ct tgg g aa g cc =ta ttt gat 1455 

Arg Asp Met Leu Asn Tyr He Gin Asn Ala Trp Glu Ala Leu Phe Asp 

450 455 46O 

Sf* 9 f a 9Ca 339 tgg atC tCC a 9 c a gt tat etc cca acg 1503 
Thr Phe Met Gin Glu Ala Lys Trp He Ser Ser Ser Tyr Leu Pro Thr 

465 470 475 

Ph! r? 9 Z* C ttg 339 aat 9Ca aaa gtt a g fc tct ggt tct cgc ata 1551 

Phe Glu Glu Tyr Leu Lys Asn Ala Lys Val Ser Ser Gly Ser Arg He 

480 485 490 

a?* Th 3 t" 3 CCC att CtC aCt ttg gat gta cca ctt cct gat tac 1599 

Ala Thr Leu Gin Pro He Leu Thr Leu Asp Val Pro Leu Pro Xsp Tyr 

495 500 505 

ata ctg caa gaa att gat tat cca tec aga ttc aat gag tta get teg 1647 
lie Leu Gin Glu He Asp Tyr Pro Ser Arg Phe Asn Glu Leu Ala Ser 
510 515 520 525 

tec ate ctt cga eta cga ggt gac acg cgc tgc tac aag gcg gat agg 1695 
Ser He Leu Arg Leu Arg Gly Asp Thr Arg Cys Tyr Lys Ala Asp Arg 

53 0 535 5 4 0 
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gcc cgt gga gaa gaa get tea get ata teg tgt tat atg aaa gac cat 1743 
Ala Arg Gly Glu Glu Ala Ser Ala lie Ser Cys Tyr Met Lys Asp His 

545 550 555 

cct gga tea ata gag gaa gat get etc aat cat ate aac gcc atg ate 1791 
Pro Gly Ser lie Glu Glu Asp Ala Leu Asn His lie Asn Ala Met lie 
560 565 570 

agt gat gca ate aga gaa tta aat tgg gag ctt etc aga ccg gat age 1839 
Ser Asp Ala lie Arg Glu Leu Asn Trp Glu Leu Leu Arg Pro Asp Ser 
575 580 585 

aaa agt ccc ate tct tec aag aaa cat get ttt gac ate ace aga get 1887 
Lys Ser Pro lie Ser Ser Lys Lys His Ala Phe Asp lie Thr Arg Ala 
590 595 600 605 

ttc cat cat gtc tac aaa tat cga gat ggt tac act gtt tec aac aae 1935 
Phe His His Val Tyr Lys Tyr Arg Asp Gly Tyr Thr Val Ser Asn Asn 

610 615 620 

gaa aca aag aat ttg gtg atg aaa ace gtt ctt gaa cct etc get ttg 1983 
Glu Thr Lys Asn Leu Val Met Lys Thr Val Leu Glu Pro Leu Ala Leu 

625 630 635 

taa aaacatatag aatgcattaa aatgtgggaa gtctataatc tagactattc 2036 
tctatctttc ataatgtaga tctggatgtg tattgaactc taaaaaaaaa aaa 2089 

<210> 6 
<211> 637 
<212> PRT 

<213> Abies grandis 
<400> 6 

Met Ala Leu Leu Ser lie Val Ser Leu Gin Val Pro Lys Ser Cys Gly 
15 10 15 

Leu Lys Ser Leu lie Ser Ser Ser Asn Val Gin Lys Ala Leu Cys lie 

20 25 30 

Ser Thr Ala Val Pro Thr Leu Arg Met Arg Arg Arg Gin Lys Ala Leu 
35 40 45 

Val lie Asn Met Lys Leu Thr Thr Val Ser His Arg Asp Asp Asn Gly 
50 55 60 

Gly Gly Val Leu Gin Arg Arg lie Ala Asp His His Pro Asn Leu Trp 
65 70 75 80 

Glu Asp Asp Phe lie Gin Ser Leu Ser Ser Pro Tyr Gly Gly Ser Ser 

85 90 95 

Tyr Ser Glu Arg Ala Glu Thr Val Val Glu Glu Val Lys Glu Met Phe 

100 105 110 

Asn Ser lie Pro Asn Asn Arg Glu Leu Phe Gly Ser Gin Asn Asp Leu 
115 120 125 
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Leu Thr Arg Leu 
130 

Arg His Phe Gin 
145 



Tyr Trp Lys Glu 



Pro Asp Leu Asn 

180 

Gly Tyr Asn Val 
195 

Gly His Phe Ala 
210 

Ser Val Leu Asn 
225 

Lys Val Met Glu 



Val Leu Gin Lys 

260 

Val Leu Glu Tyr 
275 

Asn Tyr lie Glu 
290 

Met Pro Tyr Met 
305 

Phe Asn lie Phe 



Arg Trp Trp Lys 

340 

Arg His Val Glu 

355 

Lys His Ser Ala 
370 

Thr Val Leu Asp 
385 

Gin Leu Phe Thr 



Trp Leu Pro Glu 

420 

lie Asn Glu Met 
435 



Trp Met Val Asp 
135 

Asn Glu lie Arg 
150 

Lys Glu Gly He 
165 

Ser Thr Ala Leu 



Ser Ser Asp Val 

200 

Cys Pro Ala He 
215 

Leu Tyr Arg Ala 

230 

Glu Ala Glu He 
245 

He Pro Val Ser 



Gly Trp His Thr 

280 

Val Tyr Glu Gin 
295 

Asn Met Lys Lys 
310 

His Ser Leu Gin 
325 

Glu Ser Gly Ser 



Tyr Tyr Thr Met 

360 

Phe Arg Met Glu 
375 

Asp He Tyr Asp 
390 

Asp Ala He Lys 
405 

Tyr Met Lys Gly 



Val Glu Glu Ala 

440 



Ser He Glu Arg 

140 

Val Ala Leu Asp 
155 

Gly Cys Gly Arg 
170 

Ala Leu Arg Thr 
185 

Leu Glu Tyr Phe 



Leu Thr Glu Gly 

220 

Ser Leu Val Ala 
235 

Phe Ser Ala Ser 
250 

Asn Leu Ser Gly 
265 

Asn Leu Pro Arg 



Ser Gly Tyr Glu 

300 

Leu Leu Gin Leu 

315 

Leu Arg Glu Leu 
330 

Ser Gin Leu Thr 
345 

Ala Ser Cys He 



Phe Val Lys Val 

380 

Thr Phe Gly Thr 
395 

Arg Trp Asp Leu 
410 

Val Tyr Met Asp 
425 

Glu Lys Thr Gin 



Leu Gly He Asp 



Tyr Val Tyr Ser 

160 

Asp Ser Thr Phe 
175 

Leu Arg Leu His 
190 

Lys Asp Glu Lys 
205 

Gin He Thr Arg 

Phe Pro Gly Glu 

240 

Tyr Leu Lys Lys 
255 

Glu He Glu Tyr 
270 

Leu Glu Ala Arg 
285 

Ser Leu Asn Glu 



Ala Lys Leu Glu 

320 

Gin Ser He Ser 
335 

Phe Thr Arg His 
350 

Ser Met Leu Pro 
365 

Cys His Leu Val 



Met Asn Glu Leu 

400 

Ser Thr Thr Arg 
415 

Leu Tyr Gin Cys 
430 

Gly Arg Asp Met 
445 
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Leu Asn Tyr He Gin Asn Ala Trp Glu Ala Leu Phe Asp Thr Phe Met 
450 455 460 

Gin Glu Ala Lys Trp He Ser Ser Ser Tyr Leu Pro Thr Phe Glu Glu 
465 470 475 48 o 

Tyr Leu Lys Asn Ala Lys Val Ser Ser Gly Ser Arg He Ala Thr Leu 

485 490 495 

Gin Pro He Leu Thr Leu Asp Val Pro Leu Pro Asp Tyr He Leu Gin 

500 505 510 

Glu He Asp Tyr Pro Ser Arg Phe Asn Glu Leu Ala Ser Ser He Leu 
515 520 525 

Arg Leu Arg Gly Asp Thr Arg Cys Tyr - Lys Ala Asp Arg Ala Arg Gly 
530 535 540 Y 

Glu Glu Ala Ser Ala He Ser Cys Tyr Met Lys Asp His Pro Gly Ser 

550 555 560 

He Glu Glu Asp Ala Leu Asn His He Asn Ala Met He Ser Asp Ala 

565 570 575 

He Arg Glu Leu Asn Trp Glu Leu Leu Arg Pro Asp Ser Lys Ser Pro 

580 5 8 5 59Q 

He Ser Ser Lys Lys His Ala Phe Asp He Thr Arg Ala Phe His His 
595 600 60S 

Val Tyr Lys Tyr Arg Asp Gly Tyr Thr Val Ser Asn Asn Glu Thr Lys 
610 615 620 

Asn Leu Val Met Lys Thr Val Leu Glu Pro Leu Ala Leu 
625 630 635 



<210> 7 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Degenerate 
oligonucleotide PCR primer A wherein the letter 



indicates an inosine residue 



<220> 

<221> mis c_f eature 
<222> (1) . . (25) 

<223> Degenerate oligonucleotide Primer A wherein n 
represents inosine 

<400> 7 

arraygarra nggnrartay aarga 



25 



<210> 8 
<211> 20 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: degenerate 

oligonucleotide PCR primer B wherein the letter 
"n M represents an inosine residue 

<220> 

<221> misc_f eature 
<222> (1) . . (20) 

<223> oligonucleotide PCR primer B wherein the letter n 
represents an inosine residue 

<400> 8 

atgytncary tntaygargc 



<210> 9 
<211> 24 
<2 12 > DNA 

<2 13> Ar ti f icial Sequence 
<220> 

<223> Description of Artificial Sequence: degenerate 

oligonucleotide PCR primer C wherein the letter 
"n" represents an inosine residue 

<220> 

<221> misc__f eature 
<222> (1) . . (24) 

<223> PCR primer C wherein the letter n represents 
i nosine 

<400> 9 

ctnkynrang gnctratrta ckty 



<210> 10 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: degenerate 
oligonucleotide PCR primer D wherein the letter 
"n" represents an inosine residue 

<220> 

<221> misc^feature 
<222> (1) . . (23) 

<223> PCR primer D wherein the letter n represents 
inosine 

<400> 10 

gaygaynnnt wygaygcnya ygg 



<210> 11 
<211> 108 
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<212> DNA 

<213> Artificial Sequence 
<400> 11 

gatgatgggt ttgatgcgca cggaacccta gatgaattga agctattcac tgaggctgtg 60 
agaagatggg acctctcctt tacagacaac ttccccgatt acatgaaa 108 

<210> 12 
<211> 104 
<212> DNA 

<213> Abies grandis 
<400> 12 

gacgacgggt atgatgcgca tggaacgatt gacgagcttg aactcttcac atctgcaatt 60 
aagagatgga attcatcaga gatagacagc ttccccgact atat 104 

<210> 13 
<211> 105 
<212> DNA 

<213> Abies grandis 
<400> 13 

gatgatgggt atgatgcgta cggaacgttg gaagaaatca aaatcatgac agagggagtg 60 
agacgatggg atctttcgtt gaccgcttnc cccgactata tgaaa 105 

<210> 14 
<211> 117 
<212> DNA 

<213> Abies grandis 
<400> 14 

gacgatgggt atgatgcgca tggaaccttg gaccaactca aaatctttac agagggagtg 60 
agacgatggg atgtttcgtt ggtagaccac ttnccccgac tacatgcaat ctagacc 117 

<210> 15 
<211> 2424 
<212> DNA 

<213> Abies grandis 

<220> 

<221> CDS 

<222> (2) . . (2350) 

<223> Clone AG1.28 

<400> 15 

g ggt tat gat ctt gtg cat tct ctt aaa tea cct tat att gat tct agt 49 

Gly Tyr Asp Leu Val His Ser Leu Lys Ser Pro Tyr He Asp Ser Ser 

5 10 15 

tac aga gaa cgc gcg gag gtc ctt gtt age gag att aaa gtg atg ctt 97 
Tyr Arg Glu Arg Ala Glu Val Leu Val Ser Glu He Lys Val Met Leu 

20 25 30 
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aat cca get att aca gga gat gga gaa tea atg att act cca tct act 

Asn Pro Ala He Thr Gly Asp Gly Glu Ser Met He Thr Pro Ser Ala 

35 40 45 

tat gac aca gca tgg gta gcg agg gtg ccc gec att gat ggc tct qct 

Tyr Asp Thr Ala Trp Val Ala Arg Val Pro Ala He Asp Gly Ser Ala 

50 55 60 



cgc ccg caa ttt ccc caa aca gtt gac tgg att ttg aaa aac cag tta 
Arg Pro Gin Phe Pro Gin Thr Val Asp Trp He Leu Lys Asn Gin Leu 
65 70 ' 75 eo 



aaa gat ggt tea tgg g ga att cag tec cac ttt ctg ctg tec gac cat 

Lys Asp Gly Ser Trp Gly He Gin Ser His Phe Leu Leu Ser Asp Ara 

85 90 95 y 

ctt ctt gee act ctt tct tgt gtt ctt gtg etc ctt aaa tgg aac gtt 

Leu Leu Ala Thr Leu Ser Cys Val Leu Val Leu Leu Lys Trp Asn Val 

100 105 no 

ggg gat ctg caa gta gag cag gga at t gaa ttc ata aag age aat ctq 

Gly Asp Leu Gin Val Glu Gin Gly He Glu Phe He Lys Ser Asn Leu 

H5 120 125 



gag ate ata ttt cct tct ctg tta aga gaa get caa tct ctg cgc etc 
Glu He He Phe Pro Ser Leu Leu Arg Glu Ala Gin Ser Leu Arg Leu 
145 150 



155 160 



145 



193 



241 



289 



337 



385 



gaa eta gta aag gat gaa ace gat caa gat age ttg gta aca gac ttt 433 
Glu Leu Val Lys Asp Glu Thr Asp Gin Asp Ser Leu Val Thr Asp Phe 
130 135 140 



481 



gga ctt ccc tac gac ctg cct tat ata cat ctg ttg cag act aaa egg 529 

Gly Leu Pro Tyr Asp Leu Pro Tyr He His Leu Leu Gin Thr Lys Arg 

165 170 175 

cag gaa aga tta gca aaa ctt tea agg gag gaa att tat gcg gtt ccg 577 

Gin Glu Arg Leu Ala Lys Leu Ser Arg Glu Glu He Tyr Ala Val Pro 

I 80 185 190 

teg cca ttg ttg tat tct tta gag gga ata caa gat ata gtt gaa tgq 625 

Ser Pro Leu Leu Tyr Ser Leu Glu Gly He Gin Asp He Val Glu Trp 

195 200 205 

gaa cga ata atg gaa gtt caa agt cag gat ggg tct ttc tta age tea 673 

Glu Arg He Met Glu Val Gin Ser Gin Asp Gly Ser Phe Leu Ser Ser 

210 215 220 

cct get tct act gee tgc gtt ttc atg cac aca gga gac gcg aaa tgc 721 

Pro Ala Ser Thr Ala Cys Val Phe Met His Thr Gly Asp Ala Lys Cvs 

225 230 235 240 

ctt gaa ttc ttg aac agt gtg atg ate aag ttt gga aat ttt gtt ccc 769 

Leu Glu Phe Leu Asn Ser Val Met He Lys Phe Gly Asn Phe Val Pro 

245 250 255 

tgc ctg tat cct gtg gat ctg ctg gaa cgc ctg ttg ate gta gat aat 817 

Cys Leu Tyr Pro Val Asp Leu Leu Glu Arg Leu Leu He Val Asp Asn 

26 ° 265 270 
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att gta cgc ctt gga ate tat aga cac ttt gaa aag gaa ate aag gaa 
lie Val Arg Leu Gly He Tyr Arg His Phe Glu Lys Glu He Lys Glu 
275 280 285 



865 



get ctt gat tat gtt tac agg cat tgg aac gaa aga gga att ggg tgg 
Ala Leu Asp Tyr Val Tyr Arg His Trp Asn Glu Arg Gly He Gly Trp 
290 295 300 



913 



ggc aga eta aat ccc ata gca gat ctt gag acc act get ttg gga ttt 961 

Gly Arg Leu Asn Pro He Ala Asp Leu Glu Thr Thr Ala Leu Gly Phe 
305 310 315 320 

cga ttg ctt egg ctg cat agg tac aat gta tct cca gee att ttt gac 1009 

Arg Leu Leu Arg Leu His Arg Tyr Asn Val Ser Pro Ala He Phe Asp 

325 330 335 

aac ttc aaa gat gec aat ggg aaa ttc att tgc teg acc ggt caa ttc 1057 

Asn Phe Lys Asp Ala Asn Gly Lys Phe He Cys Ser Thr Gly Gin Phe 

340 345 350 

aac aaa gat gta gca age atg ctg aat ctt tat aga get tec cag etc 1105 

Asn Lys Asp Val Ala Ser Met Leu Asn Leu Tyr Arg Ala Ser Gin Leu 
355 360 365 

gca ttt ccc gga gaa aac att ctt gat gaa get aaa age ttc get act 1153 

Ala Phe Pro Gly Glu Asn He Leu Asp Glu Ala Lys Ser Phe Ala Thr 

3*70 375 380 

aaa tat ttg aga gaa get ctt gag aaa agt gag act tec agt gca tgg 1201 

Lys Tyr Leu Arg Glu Ala Leu Glu Lys Ser Glu Thr Ser Ser Ala Trp 
385 390 395 400 

aac aac aaa caa aac ctg age caa gag ate aaa tac gcg ctg aag act 1249 

Asn Asn Lys Gin Asn Leu Ser Gin Glu He Lys Tyr Ala Leu Lys Thr 

405 410 415 

tct tgg cat gec agt gtt ccg aga gtg gaa gca aag aga tac tgt caa 1297 

Ser Trp His Ala Ser Val Pro Arg Val Glu Ala Lys Arg Tyr Cys Gin 

420 425 430 

gtg tat cgc cca gat tat gca cgc ata gca aaa tgc gtt tac aag eta 

Val Tyr Arg Pro Asp Tyr Ala Arg He Ala Lys Cys Val Tyr Lys Leu 
435 440 445 

ccc tac gtg aac aat gaa aag ttt tta gag ctg gga aaa tta gat ttc 1393 
Pro Tyr Val Asn Asn Glu Lys Phe Leu Glu Leu Gly Lys Leu Asp Phe 

450 455 460 



1345 



aac att ate cag tec ate cac caa gaa gaa atg aag aat gtt acc age 1441 

Asn He He Gin Ser He His Gin Glu Glu Met Lys Asn Val Thr Ser 

465 470 475 480 

tgg ttt aga gat teg ggg ttg cca eta ttc acc ttc get egg gag agg 1489 

Trp Phe Arg Asp Ser Gly Leu Pro Leu Phe Thr Phe Ala Arg Glu Arg 

485 490 495 

ccg ctg gaa ttc tac ttc tta gta gcg gcg ggg acc tat gaa ccc cag 1537 

Pro Leu Glu Phe Tyr Phe Leu Val Ala Ala Gly Thr Tyr Glu Pro Gin 

500 505 510 
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tat gcc aaa 
Tyr Ala Lys 
515 

gtt ctg gac 
Val Leu Asp 

530 

eta ttc act 

Leu Phe Thr 
545 

ctt cca gac 

Leu Pro Asp 



cac gag gtg 
His Glu Val 



age ttt ttc 
Ser Phe Phe 
595 

gaa get gaa 
Glu Ala Glu 

610 

ata aag aat 
lie Lys Asn 
625 

gga gtg ttg 
Gly Val Leu 



aaa gta gat 
Lys Val Asp 



att tec cgc 
He Ser Arg 
675 

cgt gga gaa 
Arg Gly Glu 
690 

gaa tgt aca 

Glu Cys Thr 
705 

ccg gcg gtg 

Pro Ala Val 



cca ttc gcc 
Pro Phe Ala 



tgc agg ttc 
Cys Arg Phe 



gat atg tat 
Asp Met Tyr 



gag get gtg 
Glu Ala Val 
550 

tat atg aaa 
Tyr Met Lys 
565 

get tgg gag 

Ala Trp Glu 
580 

aga aag gga 

Arg Lys Gly 



tgg tta get 
Trp Leu Ala 



gga ate aca 
Gly He Thr 
630 

ata atg gat 
lie Met Asp 
645 

tat cca gga 
Tyr Pro Gly 
660 

ctg gcg gat 
Leu Ala Asp 



ttg gcg tec 
Leu Ala Ser 



gag gaa gag 

Glu Glu Glu 
710 

aag gaa ctg 

Lys Glu Leu 
725 

tgc aag aag 

Cys Lys Lys 
740 



etc ttt aca 

Leu Phe Thr 
520 

gac act tat 

Asp Thr Tyr 
535 

aga aga tgg 

Arg Arg Trp 



eta tgt tac 
Leu Cys Tyr 



gca gag aag 
Ala Glu Lys 

585 

tgg gag gat 
Trp Glu Asp 
600 

get gag tat 

Ala Glu Tyr 
615 

tct ate ggc 

Ser He Gly 



ggg caa etc 
Gly Gin Leu 



aga cgt gtt 

Arg Arg Val 
665 

gac acg aag 

Asp Thr Lys 
680 

age att gaa 

Ser He Glu 
695 

get etc gat 

Ala Leu Asp 

aca aga gag 

Thr Arg Glu 



atg ctt ttc 
Met Leu Phe 
745 



aaa gtg gca 
Lys Val Ala 



gga ace eta 

Gly Thr Leu 
540 

gac etc tec 

Asp Leu Ser 
555 

caa ate tat 

Gin He Tyr 
570 

gaa cag ggg 

Glu Gin Gly 



tat ctt ctg 
Tyr Leu Leu 



gtg cct acc 
Val Pro Thr 
620 

caa cgt ata 
Gin Arg He 
635 

ctt teg caa 
Leu Ser Gin 
650 

etc aca gag 
Leu Thr Glu 



aca tat aaa 
Thr Tyr Lys 



tgt tac atg 
Cys Tyr Met 
700 

cac ate tat 
His He Tyr 
715 

ttt ctg aag 
Phe Leu Lys 
730 

gag gag aca 
Glu Glu Thr 
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tgc ttg cag 

Cys Leu Gin 
525 

gat gaa ttg 

Asp Glu Leu 



ttt aca gaa 
Phe Thr Glu 



tat gac ata 
Tyr Asp He 
575 

cgt gaa ttg 
Arg Glu Leu 
590 

ggt tat tat 

Gly Tyr Tyr 
605 

ttg gac gag 

Leu Asp Glu 



ctt ctg ttg 
Leu Leu Leu 



gag gca tta 
Glu Ala Leu 
655 

ctg aat age 
Leu Asn Ser 
670 

get gag aag 
Ala Glu Lys 
685 

aaa gac cat 
Lys Asp His 



age att ctg 
Ser lie Leu 



ccc gac gac 
Pro Asp Asp 
735 

aga gtg acg 
Arg Val Thr 
750 



act 1585 
Thr 



aag 1633 
Lys 



aac 1681 

Asn 

560 

gtt 1729 
Val 



gtc 1777 
Val 



gaa 1825 
Glu 



tac 1873 
Tyr 



agt 1921 

Ser 

640 

gag 1969 
Glu 



etc 2017 
Leu 



get 2065 
Ala 



cct 2113 
Pro 



gag 2161 

Glu 

720 

gtc 2209 
Val 



atg 2257 
Met 
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gtg ata ttc aag gat gga gat gga ttc ggt gtt tec aaa tta gaa gtc 2305 

Val lie Phe Lys Asp Gly Asp Gly Phe Gly Val Ser Lys Leu Glu Val 

755 760 765 

aaa gat cat ate aaa gag tgt etc att gaa ccg ctg cca ctg taa ^350 

Lys Asp His lie Lys Glu Cys Leu He Glu Pro Leu Pro Leu 

770 775 780 

tcaaaatagt tgcaataata attgaaataa tgtcaactat gtttcacaaa aaaaaaaaaa 2410 
aaaaaaaaaa aaaa 



2424 



<210> 16 
<211> 782 
<212> PRT 

<213> Abies grandis 



<400> 16 

Gly Tyr Asp Leu Val His Ser Leu Lys Ser Pro Tyr He Asp Ser Ser 
1 5 io 



15 



Tyr Arg Glu Arg Ala Glu Val Leu Val Ser Glu He Lys Val Met Leu 

20 25 30 

Asn Pro Ala He Thr Gly Asp Gly Glu Ser Met He Thr Pro Ser Ala 
35 40 45 

Tyr Asp Thr Ala Trp Val Ala Arg Val Pro Ala He Asp Gly Ser Ala 
50 55 60 

Arg Pro Gin Phe Pro Gin Thr Val Asp Trp He Leu Lys Asn Gin Leu 
65 70 75 80 

Lys Asp Gly Ser Trp Gly He Gin Ser His Phe Leu Leu Ser Asp Ara 

85 90 95 

Leu Leu Ala Thr Leu Ser Cys Val Leu Val Leu Leu Lys Trp Asn Val 

100 105 no 

Gly Asp Leu Gin Val Glu Gin Gly He Glu Phe He Lys Ser Asn Leu 
115 120 125 

Glu Leu Val Lys Asp Glu Thr Asp Gin Asp Ser Leu Val Thr Asp Phe 
I 30 135 140 

Glu He He Phe Pro Ser Leu Leu Arg Glu Ala Gin Ser Leu Arg Leu 
145 150 155 160 

Gly Leu Pro Tyr Asp Leu Pro Tyr He His Leu Leu Gin Thr Lys Arg 

165 170 175 

Gin Glu Arg Leu Ala Lys Leu Ser Arg Glu Glu He Tyr Ala Val Pro 

180 185 190 

Ser Pro Leu Leu Tyr Ser Leu Glu Gly He Gin Asp He Val Glu Trp 
195 200 205 

Glu Arg He Met Glu Val Gin Ser Gin Asp Gly Ser Phe Leu Ser Ser 
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210 215 220 

Pro Ala Ser Thr Ala Cys Val Phe Met His Thr Gly Asp Ala Lys Cys 
225 230 235 240 

Leu Glu Phe Leu Asn Ser Val Met lie Lys Phe Gly Asn Phe Val Pro 

245 250 255 

Cys Leu Tyr Pro Val Asp Leu Leu Glu Arg Leu Leu He Val Asp Asn 

260 265 270 

He Val Arg Leu Gly He Tyr Arg His Phe Glu Lys Glu He Lys Glu 
275 280 285 

Ala Leu Asp Tyr Val Tyr Arg His Trp Asn Glu Arg Gly He Gly Trp 
290 295 300 

Gly Arg Leu Asn Pro He Ala Asp Leu Glu Thr Thr Ala Leu Gly Phe 
305 310 315 320 

Arg Leu Leu Arg Leu His Arg Tyr Asn Val Ser Pro Ala He Phe Asp 

325 330 335 

Asn Phe Lys Asp Ala Asn Gly Lys Phe He Cys Ser Thr Gly Gin Phe 

340 345 350 

Asn Lys Asp Val Ala Ser Met Leu Asn Leu Tyr Arg Ala Ser Gin Leu 
355 360 365 

Ala Phe Pro Gly Glu Asn He Leu Asp Glu Ala Lys Ser Phe Ala Thr 
310 375 3eo 

Lys Tyr Leu Arg Glu Ala Leu Glu Lys Ser Glu Thr Ser Ser Ala Trp 
385 390 395 400 

Asn Asn Lys Gin Asn Leu Ser Gin Glu He Lys Tyr Ala Leu Lys Thr 

405 410 415 

Ser Trp His Ala Ser Val Pro Arg Val Glu Ala Lys Arg Tyr Cys Gin 

420 425 430 

Val Tyr Arg Pro Asp Tyr Ala Arg He Ala Lys Cys Val Tyr Lys Leu 
435 440 445 

Pro Tyr Val Asn Asn Glu Lys Phe Leu Glu Leu Gly Lys Leu Asp Phe 
4^0 455 460 

Asn lie He Gin Ser He His Gin Glu Glu Met Lys Asn Val Thr Ser 
465 470 475 480 

Trp Phe Arg Asp Ser Gly Leu Pro Leu Phe Thr Phe Ala Arg Glu Arg 

485 490 495 

Pro Leu Glu Phe Tyr Phe Leu Val Ala Ala Gly Thr Tyr Glu Pro Gin 

500 505 510 

Tyr Ala Lys Cys Arg Phe Leu Phe Thr Lys Val Ala Cys Leu Gin Thr 
515 520 525 

Val Leu Asp Asp Met Tyr Asp Thr Tyr Gly Thr Leu Asp Glu Leu Lys 
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530 

Leu Phe Thr Glu 
545 

Leu Pro Asp Tyr 



His Glu Val Ala 

580 

Ser Phe Phe Arg 
595 

Glu Ala Glu Trp 
610 

lie Lys Asn Gly 
625 

Gly Val Leu lie 



Lys Val Asp Tyr 

660 

lie Ser Arg Leu 
675 

Arg Gly Glu Leu 
690 

Glu Cys Thr Glu 
705 

Pro Ala Val Lys 



Pro Phe Ala Cys 

740 

Val lie Phe Lys 
755 

Lys Asp His lie 
770 



535 

Ala Val Arg Arg 
550 

Met Lys Leu Cys 
565 

Trp Glu Ala Glu 



Lys Gly Trp Glu 

600 

Leu Ala Ala Glu 
615 

lie Thr Ser lie 
630 

Met Asp Gly Gin 
645 

Pro Gly Arg Arg 



Ala Asp Asp Thr 

680 

Ala Ser Ser lie 
695 

Glu Glu Ala Leu 
710 

Glu Leu Thr Arg 
725 

Lys Lys Met Leu 



Asp Gly Asp Gly 

760 

Lys Glu Cys Leu 
775 



540 

Trp Asp Leu Ser 
555 

Tyr Gin lie Tyr 
570 

Lys Glu Gin Gly 
585 

Asp Tyr Leu Leu 



Tyr Val Pro Thr 

620 

Gly Gin Arg lie 
635 

Leu Leu Ser Gin 
650 

Val Leu Thr Glu 
665 

Lys Thr Tyr Lys 



Glu Cys Tyr Met 

700 

Asp His lie Tyr 
715 

Glu Phe Leu Lys 
730 

Phe Glu Glu Thr 
745 

Phe Gly Val Ser 



lie Glu Pro Leu 

780 



Phe Thr Glu Asn 

560 

Tyr Asp He Val 
575 

Arg Glu Leu Val 
590 

Gly Tyr Tyr Glu 
605 

Leu Asp Glu Tyr 



Leu Leu Leu Ser 

640 

Glu Ala Leu Glu 
655 

Leu Asn Ser Leu 
67 0 

Ala Glu Lys Ala 
685 

Lys Asp His Pro 



Ser He Leu Glu 

720 

Pro Asp Asp Val 
735 

Arg Val Thr Met 
750 

Lys Leu Glu Val 
765 

Pro Leu 



<210> 17 
<211> 1967 
<212> DNA 

<213> Abies grandis 

<220> 

<221> CDS 

<222> (3) . . (1736) 

<223> Clone AG4.30 

<400> 17 

tt tct gaa tct tec ate cct cga cgc aca ggg aat cat cac gga aat 47 
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Ser Glu Ser Ser He Pro Arg Arg Thr Gly Asn His His Gly Asn 
1 5 io 



15 



gtg tgg gac gat gac etc ata cac tct etc 
Val Trp Asp Asp Asp Leu He 

20 



aac teg ccc tat ggg gca 
His Ser Leu Asn Ser Pro Tyr Gly Ala 

30 



25 



cct gca tat tat gag etc ctt 

Pro Ala Tyr Tyr Glu Leu Leu 

35 

tta ctt ttg act gaa atg gaa 

Leu Leu Leu Thr Glu Met Glu 
50 

aaa cgt ctt cag ate gtt gac 

Lys Arg Leu Gin He Val Asp 

65 70 



cat 
His 
80 



ttt 
Phe 



gaa 
Glu 



ca c 
His 



gaa 
Glu 



ata 
He 
85 



caa 
Gin 



tgg tgg aac gaa aaa ggt ate 

Trp Trp Asn Glu Lys Gly He 

100 

aaa gat etc aac get aca get 

Lys Asp Leu Asn Ala Thr Ala 

115 

cga tat aac gta teg tea ggt 

Arg Tyr Asn Val Ser Ser Gly 
130 



ggg aag 
Gly Lys 
145 



ttc ttc tgc aac ttt 
Phe Phe Cys Asn Phe 

150 



caa gtg aga age atg ttg teg 
Gin Val Arg Ser Met Leu Ser 
160 165 



ccc 
Pro 



gga 
Gly 



gaa aaa gtg atg gaa 
Glu Lys Val Met Glu 
180 



eta aac 
Leu Asn 



caa 
Gin 



gtt 
Val 
195 



tta get gga 
Leu Ala Gly 



age ctt ttg gag aga ggt gaa 

Ser Leu Leu Glu Arg Gly Glu 
210 

agt gtg ccg aga tgg gag gca 

Ser Val Pro Arg Trp Glu Ala 

225 230 



caa aag ctt att gag 
Gin Lys Leu He Glu 
40 



gag 
Glu 



ate 
He 
45 



aag 
Lys 



cat 
His 



atg gat gat ggc gat cat gat tta ate 
Met Asp Asp Gly Asp His Asp Leu He 
55 60 



ggg gag gga tea aga gat tec ttc age 

Gly Glu Gly Ser Arg Asp Ser Phe Ser 

105 no 

tta gga ttt cgc get etc cga ctg cat 

Leu Gly Phe Arg Ala Leu Arg Leu His 
120 125 



act ggt gaa gaa gga aga gga gat aaa 
Thr Gly Glu Glu Gly Arg Gly Asp Lys 

155 



tta ctt cga get tea 
Leu Leu Arg Ala Ser 

170 



gag att teg ttt 
Glu He Ser Phe 

175 



gag gee aag gca ttc aca aga gaa tat 

Glu Ala Lys Ala Phe Thr Arg Glu Tyr 
185 190 

cac ggg gat gtg act gac gtg gat caa 

His Gly Asp Val Thr Asp Val Asp Gin 
200 205 

gta cgc att gga gtt tec atg get tgc 

Val Arg He Gly Val Ser Met Ala Cys 

215 220 

agg age ttt etc gaa ata tat gga cac 

Arg Ser Phe Leu Glu He Tyr Gly His 

235 



95 



143 



191 



act ttg gaa tgc ctg gga ate gat aga 239 

Thr Leu. Glu Cys Leu Gly He Asp Arg 

75 

aca get get tta gat tac gtt tac aga 287 

Thr Ala Ala Leu Asp Tyr Val Tyr Arg 

90 95 



335 



383 



gtg ttg aag aat ttc aag gat gaa aac 431 
Val Leu Lys Asn Phe Lys Asp Glu Asn 
135 140 



479 



527 



575 



623 



671 



719 



aac cat teg tgg etc aag teg aat ate aac caa aaa atg ttg aag tta 



767 



25 



wo 



99/02030 



PCT7US98/14528 



Asn His Ser Trp Leu Lys Ser Asn lie Asn Gin Lys Met Leu Lys Leu 

240 245 250 255 

gcc aaa ttg gac ttc aat att ctg caa tgc aaa cat cac aag gag ata 

Ala Lys Leu Asp Phe Asn He Leu Gin Cys Lys His His Lys Glu He 

260 265 270 

r? 9 ^ tgg aga gaC tCg ggt ata tc 9 ca 9 ct 9 aat 

Gin Phe He Thr Arg Trp Trp Arg Asp Ser Gly He Ser Gin Leu Asn 

275 280 285 

ttc tat cga aag cga cac gtg gaa tat tat tct tgg gtt gtt atg tgc 

Phe Tyr Arg Lys Arg His Val Glu Tyr Tyr Ser Trp Val Val Met Cys 

290 295 300 

It* Hf 9 CCa gag ttC tCt gaa agt aga att 9 CC ttc gcc aaa act 

He Phe Glu Pro Glu Phe Ser Glu Ser Arg He Ala Phe Ala Lys Thr 

305 310 315 

get ate eta tgt act gtt eta gat gac etc tat gat acg cac gca acq 

Ala He Leu Cys Thr Val Leu Asp Asp Leu Tyr Asp Thr His Ala Thr 

320 325 330 



335 



ttg cat gaa ate aaa ate atg aca gag gga gtg aga cga tgg gat ctt 

Leu His Glu He Lys He Met Thr Glu Gly Val Arg Arg Trp Asp Leu 

340 345 350 

teg ttg aca gat gac etc cca gac tac att aaa att gca ttc cag ttc 

ber Leu Thr Asp Asp Leu Pro Asp Tyr He Lys He Ala Phe Gin Phe 

355 360 365 

Ph~ pk C ^ a 9t ? gaa ttg ata gtt gaa atc ^ aaa c 99 caa 

Phe Phe Asn Thr Val Asn Glu Leu He Val Glu He Val Lys Arg Gin 

370 375 380 

ggg egg gat atg aca acc ata gtt aaa gat tgc tgg aag cga tac att 

Gly Arg Asp Met Thr Thr He Val Lys Asp Cys Trp Lys Arg Tyr He 

385 390 395 



gag tct tat ctg caa gaa gcg gaa tgg ata gca act gga cat att ccc 
Glu Ser Tyr Leu Gin Glu Ala Glu Trp He Ala Thr Gly His He Pro 
400 405 410 415 



^ tl* 9 ? a taC ata aag aaC ggc atg 9Ct age tea ggg atg tgt 

Thr Phe Asn Glu Tyr He Lys Asn Gly Met Ala Ser Ser Gly Met Cys 

420 425 430 

att gta aat ttg aat cca ctt etc ttg ttg ggt aaa ctt etc ccc gac 

He Val Asn Leu Asn Pro Leu Leu Leu Leu Gly Lys Leu Leu Pro Asp 

43 $ 440 445 

aac att ctg gag caa ata cat tct cca tec aag atc ctg gac etc tta 

Asn He Leu Glu Gin He His Ser Pro Ser Lys He Leu Asp Leu Leu 

450 455 460 



gaa ttg acg ggc aga atc gcc gat gac tta aaa gat ttc gag gac gag 

Glu Leu Thr Gly Arg He Ala Asp Asp Leu Lys Asp Phe Glu Asp Glu 

465 470 475 

aag gaa cgc ggg gag atg get tea tct tta cag tgt tat atg aaa gaa 



26 



815 



863 



911 



959 



1007 



1055 



1103 



1151 



1199 



1247 



1295 



1343 



1391 



1439 



1487 
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c r ' , " ~ ^ ^ L aci y " d ace ttc aat ata ggt cqa qqa 

530 Cys cys Lys Lys phe Thr phe Asn Ile **y A ^ GlJ 

5 ^ <5 

7^ rT- L tC aaa tac aga gac 59C tta tac att tct gac aag 

Leu Gin Phe lie Tyr Lys Tyr Arg Asp Gly Leu Tyr lie Ser Asp Lys 

d4o 550 



Lys Glu Arg Gly Glu Met Ala Ser Ser Leu Gin Cys Tyr Met Lys Glu 

485 490 495 

aat cct gaa tct aca gtg gaa aat get tta aat cac ata aaa ggc ate 1535 
Asn Pro Glu Ser Thr Val Glu Asn Ala Leu Asn His Ile Lys Gly lie 

500 505 510 

ctt aat cgt tec ett gag gaa ttt aat tgg gag ttt atg aag cag gat 1583 
Leu Asn Arg Ser Leu Glu Glu Phe Asn Trp Glu Phe Met Lys Gin Asp 

515 520 525 

Tr p" ^ ^ f 33 H tC ttc aat "a ggt ega gga 1631 

lie 

540 

Leu Gin £f,° ?*! *f.? * 9 * ^ ?? C tta ^ C »" aag 1679 

Tyr 

555 

gaa gta aag gac eag ata ttc aaa att eta gtc cac caa gtt cca atg 1727 

Glu Val Lys Asp Gin lie Phe Lys lie Leu Val His Gin Val Pro Met 

565 570 575 

Glu Glu tgatggtctt ggttgtagtt gtctattatg gtatattgea i 776 

ttgacattta tgcttaaagg tgtttcttaa acgtttaggg eggacegtta aataagttgg 1836 
caataattaa tatttagaga ctttgtggaa gtgtttaggg cataaaattg cctatggcct 1896 
atggcaagct acaaattgaa attgttgtgt ttataatatt tttattttat ttaaaaaaaa 1956 
aaaaaaaaaa a 

1967 

<210> 18 
<211> 577 
<212> PRT 

<213> Abies grandis 



<400> 18 
Ser Glu Ser Ser 
1 



Trp Asp Asp Asp 

20 

Ala Tyr Tyr Glu 
35 



Leu Leu Thr Glu 
50 



Arg Leu Gin lie 
65 



Phe Glu His Glu 



Trp Asn Glu Lys 



Ile Pro Arg Arg 
5 



Leu Ile His Ser 



Leu Leu Gin Lys 

40 



Met Glu Met Asp 
55 



Val Asp Thr Leu 

70 



He Gin Thr Ala 
85 



Gly He Gly Glu 



Thr Gly Asn His 
10 

Leu Asn Ser Pro 

25 

Leu Ile Glu Glu 



Asp Gly Asp His 

60 

Glu Cys Leu Gly 
75 

Ala Leu Asp Tyr 
90 

Gly Ser Arg Asp 



His Gly Asn Val 
15 

Tyr Gly Ala Pro 

30 

Ile Lys His Leu 
45 

Asp Leu Ile Lys 



Ile Asp Arg His 

80 

Val Tyr Arg Trp 
95 

Ser Phe Ser Lys 



27 



YVO 99/02030 



PCT7US98/14528 



100 

Asp Leu Asn Ala 
115 

Tyr Asn Val Ser 
130 

Lys Phe Phe Cys 
145 

Val Arg Ser Met 



Gly Glu Lys Val 

180 

Asn Gin Val Leu 
195 

Leu Leu Glu Arg 
210 

Val Pro Arg Trp 

225 

His Ser Trp Leu 



Lys Leu Asp Phe 

260 

Phe He Thr Arg 
275 

Tyr Arg Lys Arg 
290 



Phe Glu Pro Glu 
305 

He Leu Cys Thr 



His Glu He Lys 

340 

Leu Thr Asp Asp 
355 

Phe Asn Thr Val 
370 

Arg Asp Met Thr 
385 

Ser Tyr Leu Gin 



Phe Asn Glu Tyr 



Thr Ala Leu Gly 

120 

Ser Gly Val Leu 
135 

Asn Phe Thr Gly 
150 

Leu Ser Leu Leu 
165 

Met Glu Glu Ala 



Ala Gly His Gly 

200 

Gly Glu Val Arg 
215 

Glu Ala Arg Ser 
230 

Lys Ser Asn lie 
245 

Asn lie Leu Gin 



Trp Trp Arg Asp 

280 

His Val Glu Tyr 
295 

Phe Ser Glu Ser 
310 

Val Leu Asp Asp 

325 

lie Met Thr Glu 



Leu Pro Asp Tyr 

360 

Asn Glu Leu He 
375 

Thr He Val Lys 
390 

Glu Ala Glu Trp 
405 

He Lys Asn Gly 



105 

Phe Arg Ala Leu 



Lys Asn Phe Lys 

140 

Glu Glu Gly Arg 
155 

Arg Ala Ser Glu 
170 

Lys Ala Phe Thr 
185 

Asp Val Thr Asp 



He Gly Val Ser 

220 

Phe Leu Glu He 
235 

Asn Gin Lys Met 
250 

Cys Lys His His 
265 

Ser Gly He Ser 



Tyr Ser Trp Val 

300 

Arg He Ala Phe 
315 

Leu Tyr Asp Thr 
330 

Gly Val Arg Arg 
345 

He Lys He Ala 



Val Glu He Val 

380 

Asp Cys Trp Lys 
395 

He Ala Thr Gly 
410 

Met Ala Ser Ser 



110 

Arg Leu His Arg 
125 

Asp Glu Asn Gly 



Gly Asp Lys Gin 

160 

He Ser Phe Pro 
175 

Arg Glu Tyr Leu 
190 

Val Asp Gin Ser 
205 

Met Ala Cys Ser 



Tyr Gly His Asn 

240 

Leu Lys Leu Ala 
255 

Lys Glu He Gin 
270 

Gin Leu Asn Phe 
285 

Val Met Cys He 



Ala Lys Thr Ala 

320 

His Ala Thr Leu 

335 

Trp Asp Leu Ser 
350 

Phe Gin Phe Phe 
365 

Lys Arg Gin Gly 



Arg Tyr He Glu 

400 

His He Pro Thr 
415 

Gly Met Cys He 
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420 

Val Asn Leu Asn 
435 

lie Leu Glu Gin 
450 

Leu Thr Gly Arg 
465 

Glu Arg Gly Glu 



Pro Glu Ser Thr 

500 

Asn Arg Ser Leu 
515 

Val Pro Met Cys 
530 

Gin Phe He Tyr 
545 

Val Lys Asp Gin 



Glu 



Pro Leu Leu Leu 

440 

He His Ser Pro 
455 

He Ala Asp Asp 
470 

Met Ala Ser Ser 
485 

Val Glu Asn Ala 



Glu Glu Phe Asn 

520 

Cys Lys Lys Phe 
535 

Lys Tyr Arg Asp 
550 

He Phe Lys lie 
565 



425 

Leu Gly Lys Leu 



Ser Lys He Leu 

460 

Leu Lys Asp Phe 
475 

Leu Gin Cys Tyr 
490 

Leu Asn His He 
505 

Trp Glu Phe Met 



Thr Phe Asn He 

540 

Gly Leu Tyr lie 
555 

Leu Val His Gin 
570 



430 

Leu Pro Asp Asn 
445 

Asp Leu Leu Glu 



Glu Asp Glu Lys 

480 

Met Lys Glu Asn 
495 

Lys Gly He Leu 
510 



Lys Gin Asp Ser 
525 

Gly Arg Gly Leu 



Ser Asp Lys Glu 

560 

Val Pro Met Glu 
575 



<210> 19 

<211> 1416 

<212> DNA 

<213> Abies grandis 

<220> 

<221> CDS 

<222> (3) . . (1199) 

<223> Clone AGS. 9 

<400> 19 

aa aaa gtg atg gaa gag gcg aag gca ttc aca aca aat tat eta aag 47 

Lys Val Met Glu Glu Ala Lys Ala Phe Thr Thr Asn Tyr Leu Lys 
1 5 10 15 

aaa gtt tta gca gga egg gag get acc cac gtc gat gaa age ctt ttg 95 
Lys Val Leu Ala Gly Arg Glu Ala Thr His Val Asp Glu Ser Leu Leu 

20 25 30 

gga gag gtg aag tac gca ttg gag ttt cca tgg cat tgc agt gtg cag 143 
Gly Glu Val Lys Tyr Ala Leu Glu Phe Pro Trp His Cys Ser Val Gin 

35 40 45 

aga tgg gag gca agg age ttt ate gaa ata ttt gga caa att gat tea 191 
Arg Trp Glu Ala Arg Ser Phe He Glu He Phe Gly Gin He Asp Ser 
50 55 60 

gag ctt aag teg aat ttg age aaa aaa atg tta gag ttg gcg aaa ttg 239 



29 



WO 99/02030 



PCT/US98/14528 



Glu Leu Lys Ser Asn Leu Ser Lys Lys Met Leu Glu Leu Ala Lys Leu 

65 "70 75 

gac ttc aat att ctg caa tgc aca cat cag aaa gaa ctg cag att ate 

Asp Phe Asn He Leu Gin Cys Thr His Gin Lys Glu Leu Gin He He 

85 90 95 

tea agg tgg ttc gca gac tea agt ata gca tec ctg aat ttc tat cqq 

Ser Arg Trp Phe Ala Asp Ser Ser He Ala Ser Leu Asn Phe Tyr Arq 

100 105 110 

Lvs Cvs Jvr 11* l AC fctt t9g atg 9Ct 9 " 9" atc tc <= gag 

Lys Cys Tyr Val Glu Phe Tyr Phe Trp Met Ala Ala Ala lie Ser Glu 

115 120 125 

IV* tCt 993 agC aga gtt ? cc ttc aca "a att get ata ctq 

Pro Glu Phe Ser Gly Ser Arg Val Ala Phe Thr Lys He Ala He Leu 

130 135 140 

Met Thr t"* 9 * C Ctg taC 9Bt 3Ct CaC " a acc "9 3 ac caa 

Met Thr Met Leu Asp Asp Leu Tyr Asp Thr His Gly Thr Leu Asp Gin 

145 150 155 



etc aaa ate ttt aca gag gga gt g aga cga tgg gat gtt teg ttg qta 

Leu Lys lie Phe Thr Glu Gly Val Arg Arg Trp Asp Val Ser Leu vll 

165 no 175 

gag ggc etc cca gac ttc atg aaa att gca ttc gag ttc tgg tta aag 

Glu Gly Leu Pro Asp Phe Met Lys He Ala Phe Glu Phe Trp Leu lys 

180 185 lgQ 

aca tct aat gaa ttg att get gaa get gtt aaa gcg caa ggg caa gat 

Thr Ser Asn Glu Leu He Ala Glu Ala Val Lys Ala Gin Gly Gin Asp 

195 200 205 

a?! 1?° ^ ^ aat gCa tgg 9ag cga tac ctt gaa get 

Met Ala Ala Tyr He Arg Lys Asn Ala Trp Glu Arg Tyr Leu Glu Ala 

210 215 220 



£vr t~?> f ?? 9 9 f a t9g ata 9CC aCt q ^a cat gtc ccc acc ttt 

Tyr Leu Gin Asp Ala Glu Trp He Ala Thr Gly His Val Pro Thr Phe 

225 230 235 

Aso III Tvr 9 ? C aCa CCa aaC 3Ct g ^ atg tgt gta ttg 

Asp Glu Tyr Leu Asn Asn Gly Thr Pro Asn Thr Gly Met Cys Val Leu 

245 250 255 

a™ T t9 S Cg Ctt Ctg tta atg ggt cat tta cca atc gac att 

Asn Leu He Pro Leu Leu Leu Met Gly Glu His Leu Pro He Asp He 

260 265 270 

ctg gag caa ata ttc ttg ccc tec agg ttc cac cat etc att gaa ttg 

Leu Glu Gin He Phe Leu Pro Ser Arg Phe His His Leu He Glu Leu 

275 280 285 

get tec agg etc gtc gat gac gcg aga gat ttc cag gcg gag aag gat 

Ala Ser Arg Leu Val Asp Asp Ala Arg Asp Phe Gin Ala Glu Lys Asp 

290 295 300 

cat ggg gat tta teg tgt att gag tgt tat tta aaa gat cat cct gag 



287 



335 



383 



431 



479 



527 



575 



623 



671 



719 



767 



815 



863 



911 



959 
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His Gly Asp Leu Ser Cys He Glu Cys Tyr Leu Lys Asp His Pro Glu 
305 310 315 

^ Ct ^ Hf* 9at gCt " a aat Cat gtt aat ggc ctc ctt ^ c aat 1007 

Ser Thr Val Glu Asp Ala Leu Asn His Val Asn Gly Leu Leu Gly Asn 

320 325 330 Y 335 

tgc ctt ctg gaa atg aat tgg aag ttc tta aag aag cag gac agt gtg 1055 
Cys Leu Leu Glu Met Asn Trp Lys Phe Leu Lys Lys Gin Asp Ser Val 

340 345 350 

cca etc teg tgt aag aag tac age ttc cat gta ttg gca cga age ate 1103 
Pro Leu Ser Cys Lys Lys Tyr Ser Phe His Val Leu Ala Arg Ser He 

355 360 365 

° aa 9gC gat ggC ttc tcc att tc 9 aaa gtg 1151 

Gin Phe Met Tyr Asn Gin Gly Asp Gly Phe Ser He Ser Asn Lys Val 

370 375 380 

ate aag gat caa gtg cag aaa gtt ctt att gtc ccc gtg cct att tqa 1199 
He Lys Asp Gin Val Gin Lys Val Leu He Val Pro Val Pro He 
385 390 395 

tagtagatac tagatagtag attagtagct attagtattt atttcatatc aatatttact 1259 

aatgctgatg atggttaaag tccattcaga ccaatctttg gtttattgga cttaaataaa 1319 

tgaattaatt agtttgtttt aaaattgtac tatttactgt tggaaataat gttttcatta 1379 

ttgaaataac tagcacaact attttagtgt ggttgat 1416 

<210> 20 
<211> 398 
<212> PRT 

<213> Abies grandis 
<400> 20 

Lys Val Met Glu Glu Ala Lys Ala Phe Thr Thr Asn Tyr Leu Lys Lys 
1 5 io 15 

Val Leu Ala Gly Arg Glu Ala Thr His Val Asp Glu Ser Leu Leu Glv 

20 25 30 

Glu Val Lys Tyr Ala Leu Glu Phe Pro Trp His Cys Ser Val Gin Arg 
35 40 45 

Trp Glu Ala Arg Ser Phe He Glu He Phe Gly Gin He Asp Ser Glu 
50 55 60 

Leu Lys Ser Asn Leu Ser Lys Lys Met Leu Glu Leu Ala Lys Leu Asp 
65 7 0 75 80 

Phe Asn He Leu Gin Cys Thr His Gin Lys Glu Leu Gin He He Ser 

85 90 95 

Arg Trp Phe Ala Asp Ser Ser He Ala Ser Leu Asn Phe Tyr Arg Lys 

100 105 no 



Cys Tyr Val Glu Phe Tyr Phe Trp Met Ala Ala Ala II 



e Ser Glu Pro 
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115 120 125 

Glu Phe Ser Gly Ser Arg Val Ala Phe Thr Lys He Ala He Leu Met 
130 135 140 

Thr Met Leu Asp Asp Leu Tyr Asp Thr His Gly Thr Leu Asp Gin Leu 
145 150 155 160 

Lys He Phe Thr Glu Gly Val Arg Arg Trp Asp Val Ser Leu Val Glu 

165 170 175 

Gly Leu Pro Asp Phe Met Lys He Ala Phe Glu Phe Trp Leu Lys Thr 

18 ° 185 190 

Ser Asn Glu Leu He Ala Glu Ala Val Lys Ala Gin Gly Gin Asp Met 
195 200 205 

Ala Ala Tyr He Arg Lys Asn Ala Trp Glu Arg Tyr Leu Glu Ala Tyr 
210 215 220 

Leu Gin A*p Ala Glu Trp He Ala Thr Gly His Val Pro Thr Phe Asp 
225 230 235 240 

Glu Tyr Leu Asn Asn Gly Thr Pro Asn Thr Gly Met Cys Val Leu Asn 

245 250 255 

Leu He Pro Leu Leu Leu Met Gly Glu His Leu Pro He Asp He Leu 

260 265 270 

Glu Gin He Phe Leu Pro Ser Arg Phe His His Leu He Glu Leu Ala 
275 280 285 

Ser Arg Leu Val Asp Asp Ala Arg Asp Phe Gin Ala Glu Lys Asp His 
290 295 300 

Gly Asp Leu Ser Cys He Glu Cys Tyr Leu Lys Asp His Pro Glu Ser 
305 3 10 315 320 

Thr Val Glu Asp Ala Leu Asn His Val Asn Gly Leu Leu Gly Asn Cys 

325 330 335 

Leu Leu Glu Met Asn Trp Lys Phe Leu Lys Lys Gin Asp Ser Val Pro 

340 345 350 

Leu Ser Cys Lys Lys Tyr Ser Phe His Val Leu Ala Arg Ser He Gin 
355 360 365 

Phe Met Tyr Asn Gin Gly Asp Gly Phe Ser He Ser Asn Lys Val He 
370 375 380 

Lys Asp Gin Val Gin Lys Val Leu He Val Pro Val Pro He 
385 390 395 



<210> 21 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: 

oligonucleotide PCR primer E wherein the letter 
"n" represents an inosine residue 

<220> 

<221> misc feature 
<222> (1) .7(23) 

<223> PCR primer E wherein the letter n represents 
inosine 

<400> 21 

ggngaramrr tnatggarga rgc 



<210> 22 
24 
DNA 

Artificial 



<211> 
<212> 
<213> 



Sequence 



<220> 
<223> 



Description of Artificial Sequence: degenerate 
oligonucleotide primer F wherein the letter 
represents an inosine residue 



"n M 



<220> 
<221> 
<222> 
<223> 



mis cofeature 
(1) . . (24) 
PCR primer F 
inosine 



wherein the letter n represents 



<400> 22 

garytneary tnhbnmgntg gtgg 



24 



<210> 23 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: degenerate 

oligonucleotide PCR primer G wherein the letter 
"n" represents an inosine residue 

<220> 

<221> misc feature 
<222> (1).T(21) 

<223> PCR primer G wherein the letter n represents 
inosine 

<400> 23 

ccarttnarn ccyttnacrt c 



<210> 24 
<211> 533 
<212> DNA 

<213> Abies grandis 
<400> 24 
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ggggaaaaaa tgatggagga agctgaaatc ttctctacca aatatttaaa agaagccctg 60 
caaaagattc cggtctccag tctttcgcga gagatcgggg acgttttgga atatggttgg 120 
cacacatatt tgccgcgatt ggaagcaagg aattacatcc aagtctttgg acaggacact 180 
gagaacacga agtcatatgt gaagagcaaa aaacttttag aactcgcaaa attggagttc 240 
aacatctttc aatccttact cgcatatccg cattgcaacc cattctgaca atggacatcc 300 
cctttcctga tcatatcctc aaggaagttg acttcccatc aaagcttaac gacttggcat 360 
gtgccatcct tcgattacga ggtgatacgc ggtgctacaa ggcggacagg gctcgtggag 420 
aagaagcttc ctctatatca tgttatatga aagacaatcc tggagtatca gaggaagatg 480 
ctctcgatca tatcaacgcc atgatcagtg acgaagtcaa aggcttcaat tgg 533 

<210> 25 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: conserved 

amino acid motif on which the sequence of Primer D 

was baaed, wherein Xaa at position number 3 represents Thr or lie Xaa at 
position number 4 represents lie or Tyr or Phe, Xaa at position number 6 
represents Ala or Val and Xaa at position number 8 represents Ala or Gly 

<220> 

<221> SITE 
<222> { 1) . . (8) 

<223> conserved amino acid motif on which sequence of 
primer D was based 

<400> 25 

Asp Asp Xaa Xaa Asp Xaa Tyr Xaa 
1 5 

<210> 26 
<211> 8 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: conserved 

amino acid motif on which the sequence of Primer E 

was based wherein Xaa at position 3 represents Lys or Thr, Xaa at position 
4 represents Val or lie, Xaa at position 6 represents Glu or Asp 

<220> 

<221> SITE 
<222> (1) . . (8) 

<223> conserved amino acid sequence on which the 
sequence of primer E was based 

<400> 26 
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Gly Glu Xaa Xaa Met Xaa Glu Ala 
1 5 

<210> 27 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: conserved 
amino acid sequence on which the sequence of 

primer F was based wherein Xaa at position 2 represents Phe or Tyr or Asr> 
or ^ g P ° 31tl0n 3 "presents He or Leu, Xaa at position 4 represents Tnr or Leu 

<220> 

<221> SITE 
<222> (1) . . (7) 

<223> conserved amino acid sequence on which the 
sequence of primer F was based 

<400> 27 

Gin Xaa Xaa Xaa Arg Trp Trp 
1 5 

<210> 28 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: conserved 

amino acid motif on which the sequence of primer G 

was based wherein Xaa at position 6 represents Phe or Leu 

<220> 

<221> SITE 
<222> (1) . . (8) 

<223> conserved amino acid sequence on which the 
sequence of primer G was based 

<400> 28 

Asp Val He Lys Gly Xaa Asn Trp 
1 5 

<210> 29 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: T3 primer 
oligonucleotide sequence 

<400> 29 

aattaaccct cactaaaggg 20 
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<210> 30 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: T7 
oligonucleotide primer sequence 

<400> 30 

gtaatacgac tcactatagg gc 



<210> 31 
<211> 2205 
<212> DNA 

<213> Abies grandis 

<220> 

<221> CDS 

<222> (57) . . (1943) 

<223> Clone AG3.48 

<400> 31 

gttatcttga gcttcctcca tataggccaa cacatatcat atcaaaggga gcaaga atg 59 

Met 
1 



get ctg gtt tct ate tea ccg ttg get teg aaa tct tgc ctg cgc aag 

Ala Leu Val Ser He Ser Pro Leu Ala Ser Lys Ser Cys Leu Arg Lys 

5 10 is 

teg ttg ate agt tea att cat gaa cat aag cct ccc tat aga aca ate 

ber Leu He Ser Ser He His Glu His Lys Pro Pro Tyr Arg Thr He 

20 25 30 

cca aat ctt gga atg cgt agg cga ggg aaa tct gtc acg cct tec atq 

Pro Asn Leu Gly Met Arg Arg Arg Gly Lys Ser Val Thr Pro Ser Met 

35 40 45 

age ate agt ttg gee ace get gca cct gat gat ggt gta caa aga cgc 

Ser He Ser Leu Ala Thr Ala Ala Pro Asp Asp Gly Val Gin Arg Arg 

3U 55 60 6 5 

ata ggt gac tac cat tec aat ate tgg gac gat gat ttc ata cag tct 

He Gly Asp Tyr His Ser Asn He Trp Asp Asp Asp Phe He Gin Ser 

70 75 eo 

eta tea acg cat tat ggg gaa ccc tct tac cag gaa cgt get gag aga 

Leu Ser Thr His Tyr Gly Glu Pro Ser Tyr Gin Glu Arg Ala Glu Arc 

85 9 0 95 

tta att gtg gag gta aag aag ata ttc aat tea atg tac ctg gat gat 

Leu He Val Glu Val Lys Lys He Phe Asn Ser Met Tyr Leu Asp Asp 

100 105 no 



gga aga tta atg agt tec ttt aat gat etc atg caa cgc ctt tgg ata 
Gly Arg Leu Met Ser Ser Phe Asn Asp Leu Met Gin Arg Leu Trp He 
115 120 125 



107 



155 



203 



251 



299 



347 



395 



443 
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gtc gat age gtt gaa cgt ttg ggg ata get aga cat ttc aag aac gag 491 

Val Asp Ser Val Glu Arg Leu Gly lie Ala Arg His Phe Lys Asn Glu 

130 135 140 145 

ata aca tea get ctg gat tat gtt ttc cgt tac tgg gag gaa aac ggc 539 

lie Thr Ser Ala Leu Asp Tyr Val Phe Arg Tyr Trp Glu Glu Asn Gly 

150 155 160 

att gga tgt ggg aga gac agt att gtt act gat etc aac tea act gcg 587 

lie Gly Cys Gly Arg Asp Ser He Val Thr Asp Leu Asn Ser Thr Ala 

165 170 175 

ttg ggg ttt cga act ctt cga tta cac ggg tac act gta tct cca gag 635 

Leu Gly Phe Arg Thr Leu Arg Leu His Gly Tyr Thr Val Ser Pro Glu 

180 185 190 

gtt tta aaa get ttt caa gat caa aat gga cag ttt gta tgc tec ccc 683 

Val Leu Lys Ala Phe Gin Asp Gin Asn Gly Gin Phe Val Cys Ser Pro 

195 200 205 

ggt cag aca gag ggt gag ate aga age gtt ctt aac tta tat egg get 731 

Gly Gin Thr Glu Gly Glu He Arg Ser Val Leu Asn Leu Tyr Arq Ala 

210 215 220 2 25 

tec etc att gec ttc cct ggt gag aaa gtt atg gaa gaa get gaa ate 779 

Ser Leu He Ala Phe Pro Gly Glu Lys Val Met Glu Glu Ala Glu He 

230 235 240 

ttc tec aca aga tat ttg aaa gaa get eta caa aag att cca gtc tec 827 

Phe Ser Thr Arg Tyr Leu Lys Glu Ala Leu Gin Lys He Pro Val Ser 

245 250 255 

get ctt tea caa gag ata aag ttt gtt atg gaa tat ggc tgg cac aca 875 

Ala Leu Ser Gin Glu He Lys Phe Val Met Glu Tyr Gly Trp His Thr 

260 265 270 

aat ttg cca aga ttg gaa gca aga aat tac ata gac aca ctt gag aaa 923 

Asn Leu Pro Arg Leu Glu Ala Arg Asn Tyr He Asp Thr Leu Glu Lys 

275 280 285 

gac ace agt gca tgg etc aat aaa aat get ggg aag aag ctt tta gaa 971 

Asp Thr Ser Ala Trp Leu Asn Lys Asn Ala Gly Lys Lys Leu Leu Glu 

290 295 300 305 

ctt gca aaa ttg gag ttc aat ata ttt aac tec tta caa caa aag gaa 1019 

Leu Ala Lys Leu Glu Phe Asn He Phe Asn Ser Leu Gin Gin Lys Glu 

310 315 320 

tta caa tat ctt ttg aga tgg tgg aaa gag teg gat ttg cct aaa ttg 1067 

Leu Gin Tyr Leu Leu Arg Trp Trp Lys Glu Ser Asp Leu Pro Lys Leu 

325 330 335 

aca ttt get egg cat cgt cat gtg gaa ttc tac act ttg gec tct tgt 1115 

Thr Phe Ala Arg His Arg His Val Glu Phe Tyr Thr Leu Ala Ser Cys 

340 345 35 0 

att gec att gac cca aaa cat tct gca ttc aga eta ggc ttc gec aaa 1163 

He Ala He Asp Pro Lys His Ser Ala Phe Arg Leu Gly Phe Ala Lys 

355 360 365 
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atg tgt cat ctt gtc aca gtt ttg gac gat att tac gac act ttt gga 1211 

Met Cys His Leu Val Thr Val Leu Asp Asp He Tyr Asp Thr Phe Glv 

370 37 * 380 385 

Thr r° T Ctt 2?* CtC ttC tCt 9Ca att aag a ^ ^ aat 1259 

Thr He Asp Glu Leu Glu Leu Phe Thr Ser Ala He Lys Arg Trp Asn 

390 395 400 

tea tea gag ata gaa cac ctt cca gaa tat atg aaa tgt gtg tac atg 1307 

Ser Ser Glu He Glu His Leu Pro Glu Tyr Met Lys Cys Val Tyr Met 

405 410 415 

If tf ° tg aCa Cga gag gCg Sag aa 9 act 

Val Val Phe Glu Thr Val Asn Glu Leu Thr Arg Glu Ala Glu Lys Thr 

420 425 430 

caa ggg aga aac act etc aac tat gtt cga aag get tgg gag get tat 14 03 

Gin Gly Arg Asn Thr Leu Asn Tyr Val Arg Lys Ala Trp Glu Ala Tyr 

435 440 445 

^ gat *; Ca tat at S gaa gaa gca aaa tgg ate tct aat ggt tat ctg 1451 

Phe Asp Ser Tyr Met Glu Glu Ala Lys Trp He Ser Asn Gly Tyr Leu 

4b0 4 55 460 4 65 

cca acg ttt gaa gag tac cat gag aat ggg aaa gtg age tct gca tat 1499 

Pro Thr Phe Glu Glu Tyr His Glu Asn Gly Lys Val Ser Ser Ala Tyr 

470 4 75 480 

cgc gta gca aca ttg caa ccc ate etc act ttg aat gca tgg ctt cct 1547 

Arg Val Ala Thr Leu Gin Pro He Leu Thr Leu Asn Ala Trp Leu Pro 

4 85 490 495 

gat tac ate ttg aag gga att gat ttt cca tec agg ttc aat gat ttg 

Asp Tyr He Leu Lys Gly He Asp Phe Pro Ser Arg Phe Asn Asp Leu 

500 505 510 

gca teg tec ttc ctt egg eta cga ggt gac aca cgc tgc tac aag gec 1643 

Ala Ser Ser Phe Leu Arg Leu Arg Gly Asp Thr Arg Cys Tyr Lys Ala 

515 520 525 

gat agg gat cgt ggt gaa gaa get teg tgt ata tea tgt tat atg aaa 1691 

Asp Arg Asp Arg Gly Glu Glu Ala Ser Cys He Ser Cys Tyr Met Lvs 

530 535 540 545 

gac aat cct gga tea acc gaa gaa gat gee etc aat cat ate aat gee 1739 

Asp Asn Pro Gly Ser Thr Glu Glu Asp Ala Leu Asn His He Asn Ala 

550 555 5 6 o 

atg gtc aat gac ata ate aaa gaa tta aat tgg gaa ctt eta aga tec 1787 

Met Val Asn Asp He He Lys Glu Leu Asn Trp Glu Leu Leu Arg Ser 

565 570 575 

aac gac aat att cca atg ctg gec aag aaa cat get ttt gac ata aca 1835 

Asn Asp Asn He Pro Met Leu Ala Lys Lys His Ala Phe Asp He Thr 

580 585 590 

aga get etc cac cat etc tac ata tat cga gat ggc ttt agt gtt gec 1883 

Arg Ala Leu His His Leu Tyr He Tyr Arg Asp Gly Phe Ser Val Ala 

595 600 605 
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aac aag gaa aca aaa aaa ttg gtt atg gaa aca etc ctt gaa tct ate 1931 
Asn Lys Glu Thr Ly S Lys Leu Val Met Glu Thr Leu Leu Glu Ser Met 

^3 

Leu P^e taaCCatatC c "aataata agctcataat gctaaattat 1983 

tggccttatg acatagttta tgtatgtact tgtgtgaatt caatcatatc gtgtgggtat 2043 

gattaaaaag ctagagctta ctaggttagt: aacatggtga taaaagttat aaaatgtgag 2103 

ttatagagat acccatgttg aataatgaat tacaaaaaga gaaatttatg tagaataaga 2163 

2205 



ttggaagctt ttcaattgtt ttaaaaaaaa aaaaaaaaaa aa 



<210> 32 
<211> 627 
<212> PRT 

<213> Abies grandis 
<400> 32 

Met Ala Leu Val Ser He Ser Pro Leu Ala Ser Lys Ser Cys Leu Arq 

5 10 15 

Lys Ser Leu He Ser Ser He His Glu His Lys Pro Pro Tyr Arq Thr 

20 25 30 

He Pro Asn Leu Gly Met Arg Arg Arg Gly Lys Ser Val Thr Pro Ser 
35 40 45 

Met Ser He Ser Leu Ala Thr Ala Ala Pro Asp Asp Gly Val Gin Arg 

50 55 eo 

Arg He Gly Asp Tyr His Ser Asn He Trp Asp Asp Asp Phe He Gin 
" 70 75 80 

Ser Leu Ser Thr His Tyr Gly Glu Pro Ser Tyr Gin Glu Arg Ala Glu 

85 90 95 

Arg Leu He Val Glu Val Lys Lys He Phe Asn Ser Met Tyr Leu Asp 

100 105 110 

Asp Gly Arg Leu Met Ser Ser Phe Asn Asp Leu Met Gin Arg Leu Trp 
115 120 125 

He Val Asp Ser Val Glu Arg Leu Gly He Ala Arg His Phe Lys Asn 
1JU 135 140 

Glu He Thr Ser Ala Leu Asp Tyr Val Phe Arg Tyr Trp Glu Glu Asn 
4 150 155 160 

Gly He Gly Cys Gly Arg Asp Ser He Val Thr Asp Leu Asn Ser Thr 

165 170 i 75 

Ala Leu Gly Phe Arg Thr Leu Arg Leu His Gly Tyr Thr Val Ser Pro 

180 185 190 

Glu Val Leu Lys Ala Phe Gin Asp Gin Asn Gly Gin Phe Val Cys Ser 
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195 

Pro Gly Gin Thr 
210 

Ala Ser Leu lie 
225 

He Phe Ser Thr 



Ser Ala Leu Ser 

260 

Thr Asn Leu Pro 
275 

Lys Asp Thr Ser 
290 

Glu Leu Ala Lys 
305 

Glu Leu Gin Tyr 



Leu Thr Phe Ala 

340 

Cys He Ala lie 
355 

Lys Met Cys His 
370 

Gly Thr He Asp 
385 

Asn Ser Ser Glu 



Met Val Val Phe 

420 

Thr Gin Gly Arg 
435 

Tyr Phe Asp Ser 
450 

Leu Pro Thr Phe 
465 

Tyr Arg Val Ala 



Pro Asp Tyr lie 

500 

Leu Ala Ser Ser 



Glu Gly 

Ala Phe 
230 

Arg Tyr 
245 

Gin Glu 

Arg Leu 

Ala Trp 



Leu Glu 
310 

Leu Leu 
325 

Arg His 



Asp Pro 



Leu Val 

Glu Leu 
390 

He Glu 
405 

Glu Thr 

Asn Thr 

Tyr Met 

Glu Glu 
470 

Thr Leu 
485 

Leu Lys 

Phe Leu 



200 

Glu He 
215 

Pro Gly 

Leu Lys 

He Lys 

Glu Ala 
280 

Leu Asn 
295 

Phe Asn 
Arg Trp 



Arg His 



Lys His 
360 

Thr Val 
375 

Glu Leu 



His Leu 



Val Asn 

Leu Asn 
440 

Glu Glu 
455 

Tyr His 

Gin Pro 

Gly He 

Arg Leu 



Arg Ser 

Glu Lys 

Glu Ala 
250 

Phe Val 
265 

Arg Asn 

Lys Asn 

He Phe 

Trp Lys 
330 

Val Glu 
345 

Ser Ala 

Leu Asp 

Phe Thr 

Pro Glu 
410 

Glu Leu 
425 

Tyr Val 

Ala Lys 

Glu Asn 

He Leu 
490 

Asp Phe 
505 

Arg Gly 



Val Leu 
220 

Val Met 
235 

Leu Gin 

Met Glu 

Tyr He 

Ala Gly 
300 

Asn Ser 
315 

Glu Ser 

Phe Tyr 

Phe Arg 

Asp He 
380 

Ser Ala 
395 

Tyr Met 
Thr Arg 
Arg Lys 

Trp He 
460 

Gly Lys 
475 

Thr Leu 

Pro Ser 

Asp Thr 

40 



205 

Asn Leu 

Glu Glu 

Lys He 

Tyr Gly 
270 

Asp Thr 
285 

Lys Lys 

Leu Gin 

Asp Leu 

Thr Leu 
350 

Leu Gly 
365 

Tyr Asp 

He Lys 

Lys Cys 

Glu Ala 
430 

Ala Trp 
445 

Ser Asn 

Val Ser 

Asn Ala 

Arg Phe 
510 

Arg Cys 



Tyr Arg 

Ala Glu 
240 

Pro Val 
255 

Trp His 

Leu Glu 

Leu Leu 

Gin Lys 
320 

Pro Lys 
335 

Ala Ser 

Phe Ala 

Thr Phe 

Arg Trp 
400 

Val Tyr 
415 

Glu Lys 

Glu Ala 

Gly Tyr 

Ser Ala 
480 

Trp Leu 
495 

Asn Asp 
Tyr Lys 
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515 

Ala Asp Arg Asp 
530 

Lys Asp Asn Pro 
545 

Ala Met Val Asn 



Ser Asn Asp Asn 

580 

Thr Arg Ala Leu 
595 

Ma As n Lys Glu 

610 

Met Leu Phe 

625 



520 

Arg Gly Glu Glu 
535 

Gly Ser Thr Glu 
550 

Asp He He Lys 
565 

He Pro Met Leu 



His His Leu Tyr 

600 

Thr Lys Lys Leu 
615 



Ala Ser Cys He 

540 

Glu Asp Ala Leu 
555 

Glu Leu Asn Trp 
570 

Ala Lys Lys His 
585 

He Tyr Arg Asp 



Val Met Glu Thr 

620 



525 

Ser Cys Tyr Met 



Asn His He Asn 

560 

Glu Leu Leu Arg 
575 

Ala Phe Asp He 
590 

Gly Phe Ser Val 
605 

Leu Leu Glu Ser 



<210> 33 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 
oligonucleotide primer 2.2 BamHI 

<400> 33 

caaagggatc cagaatggct ctgg 



<210> 34 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 
oligonucleotide primer 2.2 Not I 

<400> 34 

agtaagcggc cgctttttaa tcatacccac 



<210> 35 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 
oligonucleotide primer 3.18 EcoRI 



<400> 35 



41 
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ctgcaggaat tcggcacgag c 

<210> 36 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR 
oligonucleotide primer 3.18 Smal 

<400> 36 

catagccccg ggcatagatt tgagctg 



<210> 37 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR 
oligonucleotide primer 10 Ndel 

<400> 37 

ggcaggaaca tatggctctc ctttctatcg 



<210> 38 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR 
oligonucleotide primer 10 BamHI 

<400> 38 

tctagaacta gtggatcccc cgggctgcag 



<210> 39 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 
oligonucleotide primer JB29 

<400> 39 

ctaccattcc aatatctg 18 



<210> 40 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



42 
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<220> 

<223> Description of Artificial Sequence: PCR 
oligonucleotide primer 2-8 

<400> 40 

gttggatctt agaagttccc 



<210> 41 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 
oligonucleotide primer 3-9 

<400> 41 

tttccattcc aacctctggg 



<210> 42 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 
oligonucleotide primer 3-11 

<400> 42 

cgtaatggaa agctctggcg 



<210> 43 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 
oligonucleotide primer 7-1 

<400> 43 

ccttacacgc ctttggatgg 



<210> 44 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 
olig 

onucleotide sequence 7—3 

<400> 44 

tctgttgatc caggatggtc 



43 
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<210> 


45 


<211> 


5 


<212> 


PRT 


<213> 


Arti 


<220> 




<223> 


Des c 




amin 


3 and 


4 r 


<400> 


45 


Asp As 


p Xa 


1 





Sequence 



Sequence : conserved 



<210> 46 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: amino acid 
motif from which oligonucleotide primers can be 
synthesized that hybridize to the monoterpene 
Leu or V lIe a or S Vaf ^ P " Sent inventi ^n f wherein Xaa at position 4 represents 

<400> 46 

His Ser Asn Xaa Trp Asp Asp Asp 
1 5 



<210> 47 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: amino acid 

motif from which degenerate oligonucleotides can 
be constructed that hybridize to the monoterpene 
synthases of the present invention 

<400> 47 

Ala Leu Asp Tyr Val Tyr 
1 5 



<21 


0> 


<21 


1> 


<21 


2> 


<21 


3> 


<22 


0> 


<22 


3> 



48 

7 

PRT 

Artificial Sequence 



Description of Artificial Sequence: amino acid 
motif from which degenerate oligonucleotide 
sequences can be constructed that hybridize to the 
monoterpene synthases of the present invention 



<400> 48 
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Glu Leu Ala Lys Leu Glu Phe 
1 5 

<210> 49 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: amino acid 
motif from which degenerate oligonucleotide 
sequences can be constructed that hybridize to 
monoterpene synthase clones of the present 
invention 

<400> 49 

Arg Trp Trp Lys Glu Ser 
1 5 

<210> 50 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: amino acid 

motxf from which oligonucleotide sequeences can be 
constructed that hybridize to monoterpene synthase 
or Ilfor'Lef ^ PrGSent invention ' wherein Xaa at position 1 repr 

<400> 50 

Xaa Leu Asp Asp Met Tyr Asp 



esents Val 



<210> 51 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: amino acid 

motif from which oligonucleotide sequences can be 
constructed that hybridize to monoterpene synthase 
or Ile^oTLu* ^ PreSent invention wherein Xaa at position 1 reperesents Val 

<400> 51 

Xaa Leu Asp Asp Leu Tyr Asp 
1 5 

<210> 52 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: amino acid 

motif from which oligonucleotide sequences can be 
constructed that hybridize to the monoterpene 

synthase clones of the present invention, wherein Xaa at position 1 
represents Val or lie or Leu p 1 

<400> 52 

Xaa Leu Asp Asp lie Tyr Asp 
1 5 



<210> 53 
<211> 7 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: amino acid 

motif from which oligonucleotide sequences can be 
constructed that hybridize to the monoterpene 

repre^s^n^rHL 0 ' ^ P "" nt wh " ei » *« »t Po-ition 6 

<400> 53 

Cys Tyr Met Lys Asp Xaa Pro 
1 5 



<210> 54 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: exemplary 
oligonucleotide that corresponds to peptide 
sequence MetMetMet 

<400> 54 
atgatgatg 



<210> 55 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 



<223> Description of Artificial Sequence: exemplary 
oligonucleotide sequence that corresponds to 
peptide sequence MetMetMet 



<400> 55 
tactactac 



<210> 56 
<211> 9 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: exemplary 
oligonucleotide that corresponds to peptide 
sequence MetMetMet, n is inosine 

<400> 56 
nacnacnac 

9 



<210> 57 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

oligonucleotide corresponding to amino acid 
sequence set forth in SEQ ID NO: 46 

<220> 

<221> misc feature 
<222> (1) .7(24) 

<223> Oligonucleotide that corresponds 
amino acid sequence set forth in 

<400> 57 

gtgtcgttgg agaccctgct gctg 



to the conserved 
SEQ ID NO:46 



<210> 58 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

oligonucleotide sequence corresponding to amino 
acid sequence set forth in SEQ ID NO: 47 

<220> 

<221> misc__f eature 
<222> (1) . . (18) 

<223> Oligonucleotide corresponding to amino acid 
sequence set forth in SEQ ID NO: 47 

<400> 58 

cgggagctga tgcagatg 



<210> 59 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

oligonucleotide that corresponds to amino acid 
sequence set forth in SEQ ID NO: 48 
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<220> 

<221> misc_feature 
<222> (1) . . (21) 

<223> Oligonucleotide that corresponds to conserved 
amino acid sequence set forth in SEQ ID NO: 48 

<400> 59 

ctcgagcggt tcgagctcaa g 21 



<210> 60 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

oligonucleotide that corresponds to amino acid 
sequence set forth in SEQ ID NO: 49 

<220> 

<221> misc_f eature 
<222> (1) .7(18) 

<223> Oligonucleotide that corresponds to conserved 
amino acid sequence set forth in SEQ ID NO: 49 

<400> 60 

gccaccacct tcctctcg 18 



<210> 61 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

oligonucleotide sequence corresponding to amino 
acid sequence set forth in SEQ ID NO: 50 

<220> 

<221> misc_f eature 
<222> (1) . . (21) 

<223> Oligonucleotide sequence corresponding to amino 
acid sequence set forth in SEQ ID NO: 50 

<400> 61 

gaggagctgc tgtacatgct g 21 



<210> 62 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

oligonucleotide corresponding to amino acid 
sequence set forth in SEQ ID NO: 51 
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<220> 

<221> misc feature 
<222> (1) .7(21) 

<223> Oligonucleotide corresponding to conserved amino 
acxd sequence set forth in SEQ ID NO: 51 

<400> 62 

gaggagctgc tggagatgct g 



21 
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